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^ , Abstract 



This is a collection of lectures given at the University of Heidelberg. They are about Dirac's 
| general method to construct a quantum theory out of a classical theory, which has to be defined 

in terms of a Lagrangian. The classical Hamiltonian formalism is reviewed, with emphasis on the 
relation between constraints and gauge symmetries, and quantization is carried out without any 
kind of gauge fixing. The method is applied to three examples: the free electro-magnetic field, the 
relativistic point particle, and the very first steps of string theory are carried out. 

o ' 

O '■ 1 The Hamiltonian Method 

The best way to get a feeling of how the Dirac Programme works is to study some simple examples. 
Consider a set of non-relativistic particles, described by a set of coordinates q 1 , where i labels the 
particle number as well as the space coordinate, or think of the system as a set of particles in one 
space dimension. More generally, the q 1 can be thought of as coordinates on a manifold, which is called 
the configuration space Q. The classical motion of our system is given by a path in Q, i.e. a function 
q(t), where t is the time and q = (. . . , q l , . . .) denotes a point in Q. Such a path will be called a time 
evolution, or simply an evolution. 

The physical dynamics of our system is described by an action functional, which assigns a real 
number to each evolution. In "real life" only those evolutions are realized for which this action becomes 
extremal. What we need for our purpose is that the action is given as a time integral of a Lagrangian, 
which is a function of the q % and their velocities q 1 , i.e. 

I[q(t)}= J tit L(q(t),q(t)). (1.1) 

In principle, we could also consider Lagrangians which depend on higher derivatives of q or explicitly on 
t. For the former, it is possible to introduce additional variables, such that they can always be rewritten 



in the form (1.1). If the Lagrangian depends on t explicitly, the programme to be presented below works 
in the same way, except that certain functions to be introduced will become time-dependent as well. 
However, the most interesting examples to which we want to apply the programme are those where 
not only the Lagrangian is independent of t, but where even t does not denote the physical time (so a 
t-dependent Lagrangian wouldn't even make sense). But let us come to this later on and stick to t as 
the physical time for the moment. I'll leave it as an exercise to modify what follows such that it can be 
applied to time-dependent Lagrangians. Hence, ( |1.1|) will be the most general action to be considered 
here. 
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To be more precise, L is actually a function on the tangent bundle of Q, which means that it takes 
a point q £ Q and a vector v £ T q Q at q as arguments. Only after choosing a particular path q(t), we 
can set v(t) = q(t) and evaluate the integral ( |1.1| ). So, we shall in the following avoid the somewhat 
sloppy notation L(q,q) and write L(q,v) instead. A typical Lagrangian for a set of non-relativistic 
particles is then given by 

L(q,v) = lM ijV V -V(q), (1.2) 

where Mj,- is some "mass matrix" (e.g. Mij = m5ij for a collection of particles with the same mass or one 
particle in some higher dimensional space), and the Einstein convention is used to indicate summation 
over indices. We know how to derive the equations of motions from the Lagrangian. By varying the 
action with respect to the path q(t), we find that the time evolution has to satisfy the Euler Lagrange 
equations 

^t^ {q ^ m = W {q{tUm (L3) 

In general, these are second order differential equations for the function q(t), and a time evolution which 
satisfies these equations will be called a solution. 

The classical part of the Dirac programme is essentially to transform the Euler Lagrange equations 
into a standard form which can be used to classify their solutions. First of all, as for any set of differential 
equations, they can be rendered first order by introducing some auxiliary variables. In this case, we 
need one additional variable pi for each q 1 , and the most reasonable way to do this is to choose pi to 
be the term under the time derivative on the left hand side of Ql-3| ), which gives the set of first order 
equations 

dL dL 
Pi(t) = — i {q{t),q{t)), p i (t) = — i (q(t),q{t)). (1.4) 

If the configuration space is not a vector space, then the momenta p% have to form a covector at q, i.e. 
an element of the cotangent space T*Q. The time evolution is now given as a path in the cotangent 
bundle T*Q, which is called the phase space V. spanned by the coordinates q l and p%. A point in V 
will be called a state. For our non-relativistic particles we find 

Pi {t) = Mi^(t), Pi {t) = ~ (q(t)). (1.5) 

If the mass matrix is diagonal, or if there is at least an inverse satisfying M^Mj^ = S\, we get 

q i (t)=M i *p j (t), Pl (t) = -^-{q(t)). (1.6) 

This is a set of first order differential equation in the standard form x(t) = F(x(t)), where x{t) G V 
is the state evolving in time. We can start with any initial state xq = (qo,Po) at, say, t = 0, and 
get a corresponding solution of the equations of motion by integrating ( |1.6| ). Ignoring the problem of 
singularities, there is a one-to-one correspondence between points in phase space and solutions to the 
Euler-Lagrange equations. 

Let us now come back to the general equations of motion ( |1.4[ ). Can we rewrite them in the standard 
form x(t) = F(x(t)) as well? An elegant way to answer this question is to get back to the action principle 
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(1.1). In addition to the momenta, we also introduce velocities v l as auxiliary variables, and define a 



new action 

I[q,p,v} = Jdt tfpi-py + L(q,v), (1.7) 

where q(t),p(t),v(t) are to be treated as independent functions of the time t. Variation with respect 
to p and v gives 

dL 

v l = q\ pi = — ( q , v ). (1.8) 



Reinserting this into the action takes us back to (1.1), together with the correct definition for the 



momenta. So the new first order variational principle is equivalent to the former. To obtain a set of 
first order differential equations in standard form out of ( |1.7| ), all we have to do is to get rid of the v 
variables therein. They are not very interesting anyway, because, for any solution, they are nothing but 
the time derivative of q. Let's try the following. On the phase space, we introduce the Hamiltonian 

H(q,p)=Ext v ( Pi v l -L{p,v)), (1.9) 

where Ext„ denotes the extremum of the expression to the right, with respect to the variables v. For 
the moment we assume that there is a unique such function. Then, we can go over to a new action 
principle which is entirely defined on V and given by 

I[q,p] = Jdt ct Vi -H{q,p). (1.10) 

The corresponding equations of motion are the Hamilton equations 

q l (t) = H (q(t), P (t)), kit) = (q(t), P m (1-H) 

They provide a set of first order differential equations in the standard form, which describe the time 
evolution of the state, and they are equivalent to the Euler Lagrange equations. To simplify the notation, 
we introduce an important structure on the phase space, the Poisson bracket. To motivate its definition, 
take a function F(q,p) on V and consider its time evolution, i.e. the value of F(t) = F(q(t),p(t)). Using 
([LTll) we find 



p , t) = dFdH _dHdF 
dq l dpi dq l dpi ' 



Note that q l and pj are just some special functions on V, so ( 1.11 ) can be expressed in this way as 
well. The right hand side is another function on V, which has to be evaluated at q(t),p(t), and this 
function is called the Poisson bracket of F and H. Generally, the Poisson bracket maps two phase space 
functions F and G onto a third function 

{F)G x = ^L^ ( i. 13) 

' dq 1 dpi dq l dpi ' 

It's basic properties are antisymmetry, linearity, the Leibnitz rule and the Jacobi identity: 

{F, G} = —{G, F}, {F + G, K} = {F, K} + {G, K}, 
{FG,K} = {F,K}G + F{G,K}, 

{{F, G},K} + {{K, F},G} + {{G, K},F} = 0. (1.14) 
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I will not prove this here as all this should be well known. We can replace the equations of motion by 
requiring that 

F = {F,H} (1.15) 



for every phase space function F. By inserting the special functions q l and pi, we recover ( 1,11|) , and 
due to pi the converse is true as well. If we are a little bit sloppy, we can also write 



x = {x,H}, (1.16) 

but here we have to interpret x as a collection of coordinates rather than a point on V, as the Poisson 
bracket takes functions on V as arguments and not points. Nevertheless it is a useful notation and 
should not lead to confusion. As a result, the classical dynamics of our system is now completely 
determined by two basic objects on the phase space: the Hamilton function and the Poisson bracket. 
But all this was based on the fact that the Hamiltonian can be defined by (|1.9| ). A system which can be 
treated in this way will be called unconstrained (we will see shortly why). In the context of the Dirac 
programme these are not of much interest, as everything what follows, at least at the classical level, will 
be rather trivial. Let us nevertheless stick to such a system for a while, just to introduce some more 
notions which will be useful below. 

A function on V generally denotes a quantity that (in principle) can be "observed" , like the positions 
of the particles, or the distances between them or whatever. Let us call them observables. With the 



Poisson bracket the set of observables becomes a Lie algebra. If F is an observable, then (1.15) tells 
us that there is another observable associated with the time derivative of F. An observable Q whose 
Poisson bracket with H vanishes is called a constant of motion or conserved charge, as whatever the 
state of the system is the value of Q(t) will be constant. The constant functions on V are the trivial 
conserved charges, and if Q and P are two conserved charges, then so are Q + P, QP, and also {Q, P}. 
The constants of motion form a subalgebra of the observables. The Hamiltonian is always a conserved 
charge, which is usually called the energy. 

As is also well known from classical mechanics, conserved charges are associated with symmetries 
via the Noether theorem. How does this show up here? As the conserved charges form a Lie algebra, 
we should expect that these are the infinitesimal generators of symmetry transformations. Let Q be 
some conserved charge, then the symmetry transformation generated by Q is 

5x = {x,Q}, (1.17) 

where the bracket is to be understood as in ( |1.16| ). By using the Jacobi identity and the fact that 
{Q,H} = 0, we see that with x(t), x(t) + 5x(t) is a solution as well, provided that we choose a time- 
independent Q. You can also check that the commutator of two infinitesimal symmetry transformations 
gives the transformation generated by the Poisson bracket of the two generating functions, so the algebra 
of conserved charges is in fact the Lie algebra associated with the symmetry group of the system. If 
we take H as a generator of a symmetry, then (1.17) formally coincides with ( 1.16[ ). This means that 



the symmetry transformation induced by H is the time translation, which is always a symmetry as we 
started from a time-independent Lagrangian. 

How can we now apply this "Hamiltonian method" to a more general Lagrangian? Suppose that 
the mass matrix in our example above becomes singular. Then there will in general be no extremum of 
the function appearing in (1.9). We have to find another way to eliminate the v variables from (|1 . 7|) . 



Let's have a look at the equation of motion for v, which reads 
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It does not contain any time derivatives, and of course it is just the condition we have to solve to find 
the extremum in (|1,9|). Now suppose it cannot be solved for a given state (q,p). This means that 
the Hamiltonian cannot be defined at this point of the phase space, but it also means that there is no 
solution to the equations of motion passing through this point in V. So perhaps we don't need the 
Hamiltonian there, as we will never get there. On the other hand, if ( |1.18|) can be solved, there is an 
extremum in ( |1.9D , and provided that the Lagrangian is not too ill-natured[], H(p, q) is well-defined. So 
we can only define the Hamiltonian on a subspace of V which is given by the image of the momentum 



map ( 1.18 ), as only for those states we can find the extremum required for H. Again assuming that 
the Lagrangian is not to weird, this image can be described in terms of a set of equations ip a {q,p) = 0, 
which we shall call the primary constraints. The subspace defined in this way is called the primary 
constraint surface (of course, the name suggests that there will be more constraints later on). 

Knowing that all possible solution entirely lie inside the primary constraint surface, and that H is 



well defined thereon, we can again replace the action principle by ( 1.10| ), but now with the restriction 



that the path has to lie inside the primary constraint surface. There is a general method to deal 
with such "constrained variational problems". We can make the explicit restriction implicit by adding 
Lagrange multiplier terms to the action and define 

I[q,p,u] = Jdt (Ipi - H (q,p) - u a ifj a (q,p), (1.19) 

where Hq has to coincide with H on the primary constraint surface, but is otherwise arbitrary (choosing 
a different extension of H corresponds to a transformation of the u variables). Defining the combination 

H(q,p) = H (q,p) + u a ^ a (q,p) (1.20) 

to be the Hamiltonian, the complete set of equations of motions become 

.,■ dH . dH , 

fc=0, ,- = ^, p,= g-. (1.21) 

They already look similar to ( |1.11| ), but there are some crucial differences. So, for example, the auxiliary 
variables u a are still in there, which seem to have no physical meaning. In fact, they parametrize the 
Hamiltonian, so we do not only have one Hamiltonian but a whole set of them. How do we know which 
is the right one, or are we free to choose it as we like? The u a are quite implicit in ( |1.21| ), so it is 
not obvious what their meaning is. Before coming to these questions in general, let us consider some 
examples. 

A singular mass matrix occurs whenever there are coordinates whose velocities do not appear in 
the Lagrangian. Such variables typically appear as Lagrange multipliers, and are in classical mechanics 
used to impose constraints onto a system, just as in our auxiliary action above. But now we will have 
such terms already in the original action. A nice example is the following. The configuration space is 
three dimensional, with coordinates qi,q2,Q3, and the Lagrangian is 

L = ±m(V + v 2 2 ) - k 3 {qi 2 + Q2 2 ~ r 2 ). (1.22) 



1 In general, there will be not only one value v where the function in (L.9) becomes extremal, but in almost all "physical" 
cases the set of these values is connected, so that the actual extremum is constant. This requires the Lagrangian to be 
in some sense "regular", e.g. at most quadratic in the velocities. In all our examples this will be the case, but there are 
funny systems where this is not true. Take, e.g., L — v 3 and consider piecewise smooth time evolutions. Then solve the 



extremum condition for the action (1.1) and try to reproduce these solutions with the Hamiltonian method! 
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It should not be too difficult to see that this describes a particle of mass m which moves on a circle of 
radius r in a two dimensional plane spanned by qi,q2, with (73 being the force necessary to make the 
particle stay on the circle. The general solution to the Euler Lagrange equations are 

qi(t) = r cos(ujt + ip), l2(t) = rsin(ujt + <p), ^(t) = mw 2 , (1-23) 

parametrized by oj,ip G K. The energy of the particle is ^mr 2 w 2 , which can be expressed in terms of 
the configurations variables as \r 2 q$. So we expect this to be the Hamiltonian. Let us just keep this in 
mind, and try to reproduce the solution by applying the Hamiltonian method. By differentiating the 
Lagrangian we find the following momentum map: 

dL dL dL , 

p 1 = —=mv 1 , P2 = —=mv 2 , p 3 = — = 0. (1.24) 

OV\ OV2 OV3 

The image of this map is obviously given by the subspace p% = of the phase space, so our primary 
constraint is 

Ml,P)=P3. (1-25) 

It is not difficult to find the Hamiltonian by solving ( |1.24| ) for v and insert the solution into piVi — L. 
Note that V3 remains completely arbitrary, but the value of the extremum does not depend on V3, as 
P3 = on the constraint surface. The total Hamiltonian analogous to (|1.20 ) then becomes 



H{q,p) = \m- 1 {p 1 2 + P 2 2 ) + \q ?J {qi 2 +q 2 2 -r 2 )+u^ (1.26) 

and the time evolution equations read 

dH , . dH 

qi = -^—=rn pi, pi = -— = -q s qi, 
dpi dqi 

dH . dH 

q2 = -^—=m p 2 , P2 = --^—=-qm, 

dp2 oq 2 

h = ^=u, p 3 = = -\{qi 2 + q 2 2 - r 2 ). (1.27) 

dp3 dq 3 2 

Let's try to integrate these equations, starting with a state on the constraint surface, i.e. with p 3 = 0. 
This doesn't work quite well, because the last equation immediately tells us that p% will not stay zero, 
unless qi 2 + q2 2 = r 2 . Hence, it is not enough to start with a state on the primary constraint surface, as 
this will in general evolve away from it. There are more restrictions on the initial condition than just 
P3 = 0. Its time derivative must vanish as well, which gives us another constraint 

4>2 = Uqi 2 +q2 2 -r 2 ). (1.28) 



Note that this is implicit in ( 1.27| ), so it is actually not a new equation of motion, but its puts a new 
restriction on the initial conditions, which have to be fulfilled before we start to integrate the time 
evolution. We shall call this a secondary constraint. In other words, it singles out jet another subspace 
of V. Of course, this is just the restriction that our particle moves on a circle with radius r. So we have 
to add this to the constraints. 

Are we finished now? Starting with a state (q,p) satisfying both constraints, will we stay on the 
constraint surface for all times? To find this out, all we have to do is to compute the time evolution of 
the phase space function ip2, which gives 

1P2 = {4>2,H} = mT 1 (p 1 qi +p 2 q2)- (1-29) 
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So we need another constraint 



^3 = PiQi + P2Q2- (1-30) 

More? 

V> 3 = {V> 3 , H} = -q 3 ( Ql 2 + q 2 2 ) + m~ X { Vl 2 + p 2 2 ). (1.31) 
Again, a new constraint, but we can simplify it a bit using ip 2 : 

ip4 = q 3 -m- 1 r- 2 ( Pl 2 +p 2 2 ) (1.32) 

Same procedure again: 

tp4 = {i>A,H} =u + 2m~ 1 r~ 2 q 3 (p 1 q 1 + p 2 q 2 ). (1-33) 

This is something new. It does not give a new constraint, but a restriction on u. As it is sufficient that 
the right hand side vanishes on the constraint surface (now including all the constrains collected so far), 
we only have to require u = on that surface, but arbitrary outside. So, our original idea that u might 
enter the Hamiltonian as a free parameter can not be confirmed. It is at least fixed on the constraint 
surface. 

But now we are through. We found all constraints, four all together, and once we start with a state 
solving them, we will stay on the constraint surface for all times. It is not difficult to see that the 
general solution to all constraints is 

qi=rcosip, q 2 = rsinip, q 3 = mui 2 , 

pi = — mujr sin if, p 2 = mujr cos ip, p 3 = 0, (1-34) 

with (p,ui two real parameters, and that the time evolution is given by replacing ip ip + ut, leading 
back to ( p.. 23 ). So, this was a quite instructive example to show how secondary constraints are derived, 



leading to a smaller constraint surface, which finally became two dimensional. We shall call this the 
constraint surface S. It can also be called the physical phase space, but sometimes this notion is used 
differently, so let us stick to the first one. In our example, the constraint surface looks like the phase 
space of a particle on a circle, with configuration variable <p and momentum mur. But the Hamiltonian 
is not simply the energy, which was found to be \r 2 q 3 . However, if we look at it more closely, we find 
that 

H = \r 2 q 3 + u a ip a , with u\ = 0, u 2 = q 3 , u 3 = 0, u^ = -\r 2 . (1.35) 

We see that, on the constraint surface, the Hamiltonian indeed gives the energy, but there are extra 
contributions proportional to the constraints. What is the reason for this? To derive the evolution 



equations in (1.27), we need the derivatives of H rather than H itself, so it is not sufficient two know 
H on the constraint surface, but we also need to know its first derivatives. We do not need to know its 
second derivatives or the value of H "far away" from the constraint surface. This means that we are 
free to add anything to H which vanishes and whose gradient vanishes on the constraint surface, i.e. 
which is proportional to the square of the constraints. But this again means that we are free to add 
anything that vanishes on the constraint surface to the u parameters. To express such a condition in a 
convenient way, we introduce the following notations. Two phase space functions F and G are defined 
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to be weakly equal, written as F ps G, if they coincide on the constraint surface. The conditions to be 
imposed on the u's can then be written as 

ui ps 0, u 2 ~ 53, ^3 ~ 0, Ui ps -^r 2 . (1.36) 

With weak equalities we have to be careful, because they are not compatible with the Poisson bracket. 
If we have F ps G, this does not imply {F, K} ps {G, K} in general. We are not allowed to use weak 
equalities inside the brackets. The result is that the Hamiltonian, as a phase space function, is only 
fixed up to terms of second order in the constraints, but this is all we need as only first derivatives of 
H on the constraint surface appear in the Hamilton equations. 

Let us consider jet another example, which in a sense will also turn out to be "unnecessarily com- 
plicated" because of "unconventional" variables, but it behaves quite differently from the former is 
explains another important issue concerning constrained systems. This time, the configuration space is 
two dimensional and the Lagrangian is 

L=\m(v l +v 2 f -V( qi +q 2 ). (1.37) 

It looks a bit silly, you will ask why shouldn't we take q = q\ + q 2 as a new variable, then the second 
will drop out and we are left with a single particle in one dimension. But this system will provide us 
with the simplest example of a gauge theory. The momenta are 

Pi = m(vi+v 2 ), p 2 = m(vi + v 2 ), (1.38) 

which gives the primary constraint 

ip=p 1 -p 2 . (1.39) 

Again, the Hamiltonian contains one free parameter and is easily evaluated to 

H(q, p) = \m- l p x 2 + V( qi +q 2 )+u ij>. (1.40) 

Note that we could also put p 2 into the first term, which gives the same function on the constraint surface, 
but this can be compensated by replacing u with u — ^m(pi +p 2 ). We also find that {ip,H} = 0, so 
there are no more constraints, nor is there any restriction on u. The evolution equations become 

q 1 =m~ 1 pi+u, q 2 = -u, i>\ = -V'(qi + q 2 ), p 2 = -V'(qi + q 2 ), (1.41) 

showing explicitly that the constraint is conserved. We see that, on the constraint surface, the Hamil- 
tonian gives the energy of a particle at q\ + q 2 with momentum p\ = p 2 = m(qi + q 2 ). The physical 
interpretation of this system is as follows. There is a particle in one dimension whose position is qi + q 2 , 
but it has some internal degree of freedom which can be taken to be the difference q\ — q 2 . The time 
evolution of this degree of freedom is completely arbitrary. Moreover, we see what the role of the 
constraint in this case is. It is the generator of a symmetry transformation, which is given by 

5qi = u{q 1 , tp} = u, 5q 2 = u{q 2 ,ip} = -u, 

5pi = u{pi, Y>} = 0, 5p 2 = u{p 2 , V>} = 0. (1-42) 

It leaves the "physical" quantities, the position q\ + q 2 and the momentum invariant, but changes 
the internal degree of freedom. The time evolution consists of a "physical" evolution of the position 
and momentum, plus a transformation generated by the constraint, with an arbitrarily time-dependent 
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parameter. Starting from the same initial conditions, the state can evolve into different final states, as 
we are free to choose u. If we want physics to be deterministic, we have to assume that this degree 
of freedom is unphysical, and cannot be measured. Otherwise we would not find classical physics to 
be deterministic. Two states, which can evolve out of a single state in this way must be declared to 
be physically equivalent, and a transformation that takes such states into each other is called a gauge 
transformation. It is important to note that all this does not come out because we didn't impose 
enough initial conditions. Perhaps, if we, in addition to the velocities, also fix the acceleration, then 
the evolution might become unique again. But this is not true, which follows from a theorem about 
differential equations, but which can also be seen quite directly. Assume we have a solution which was 
generated by some choice for the u parameter. Then, change this parameter, but only for times t > to- 
The new solution will be identical to the old one for t < to, but different thereafter. Hence, there is no 
chance to fix the time evolution by giving more initial conditions. 

As a result, we found that there are different kinds of constraints. In the first example there were no 
gauge degrees of freedom, and the Hamiltonian was unique (up to second order terms in the constraints). 
In the second example, it was not unique, but contained one free parameter (of course, here we are also 
free to add anything proportional to the square of the constraint, so we can make u any time-dependent 
phase space function). Can we somehow see this difference by expecting the constraints themselves? In 
fact, a general criterion can be found to distinguish these two classes of constraints, and to work this 
out let us come back to the general theory. 

What we had so far was that the equations of motion can be derived from a Hamiltonian which 
takes the form 

H(q,p) = H (q,p)+u a ^ a (q,p), (1.43) 

where ip a are the primary constraints derived from ( |l,18| ). The u were introduced as a set of free 
parameters, but to be a bit more general we can also allow them to be a set of phase space functions 
containing some free parameters (possibly we are forced to do so, see ( |1.35| )). If we want to use the 
general formula x = {x,H} to obtain the time evolution, we first have to find out which initial states 
are allowed. From the example we learned that this can be done be requiring that, beside the primary 
constraints, also their time derivatives vanish. It is enough for them to vanish weakly, because this 
already ensures that the state remains on the constraint surface, so we have to consider the conditions 

i> a = {V Q , H} « H } + U P{4> a , ^} « 0. (1.44) 

If the u's are phase space functions, we have to take this into account when we compute brackets with H. 
However, the relevant part of the result is always proportional to some constraint, so it vanishes weakly, 
and we made use of this in ( ^44|) . We can split this set of equations into a set of secondary constraints, 



and some equations restricting the it's, which effectively reduce the number of free parameters. Of 
course, the newly found constraints, also denoted by ip a with an extended range for a, have to be 
conserved under the time evolution as well. So we again have to evaluate ( |1.44| ) for them, possible 
getting more constraints and so on, until we either arrive at a contradiction or the new conditions 
are trivial. In the first case, there are no solutions to the equations of motion, the theory is rather 
uninteresting. In the second case, we end up with a full set of constraints 

MQ,P)=0, (1.45) 

which define the constraint surface S and the weak equality relation. The Hamiltonian takes the form 

H(q,p) = H (q,p) + u a (q,p) ^ a (q,p), (1.46) 
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where the u a may contain some free parameters, and are otherwise fixed weakly only. Note that there is 
a non-trivial step from (1.43) to ( |1.46| ), as now we are summing over all constraints, not just the primary 



ones. This is justified because we can now modify the Hamiltonian anywhere except at the constraint 
surface, which is in general smaller than the primary constraint surface. The it's corresponding to 
secondary constraints will however be fixed, because free parameters only appear in front of the primary 
constraints (if Hq is taken to be the same function as in ( |1.43| ), we simply have vP ~ for the secondary 
ones). The question that remains to be answered is whether there are any free parameters left after this 
procedure. Can we perhaps localize them without solving the equations ( |1.44 ) explicitly? We would 



then also know which constraints are those that generate gauge transformations and which are not. 

After finding all the constraints, we can once again have a look at the equations for the u a parameters. 
They were all of the form ( |1.44| ). If we now take all constraints and the Hamiltonian from ( 1.46Q , then 
the u's have to satisfy the following set of weak equations 

{iT Q ,^ a } + ^{Va,#}~0, (1.47) 

which will in general be over-complete as part of it has already been solved to derive the secondary 
constraints, but nevertheless it forms a full set of equations for the u's. Actually, only the u's corre- 
sponding to the primary constraints are entering these equations as variables, but we can take them 
as equations for the secondary u's as well for the moment, and look for the general solution (we know 
that there is a solution with u@ ~ for the secondary u's, but there might be more and below we will 
see that these turn out to be useful as well). Whether or not there are free parameters in the u's now 
crucially depends on the antisymmetric square matrix 

A aj3 = {^ a ,^}. (1-48) 

If it is invertible, we can solve ( |1.47| ), giving us a (weakly) unique solution for all u's, and thus a unique 
Hamiltonian and a unique time evolution. By computing the relevant 4x4 matrix you can check that 
this is the case for the particle on the circle. But suppose the matrix is not invertible. Then there 
are some zero eigenvectors. As we are free to make linear transformations on the constraints, we can 
assume that there is a subset of constraints <j) a , whose brackets with all constraints weakly vanishes, i.e. 

{0«,^«}«O. (1.49) 

It is obvious that then ( |1.47| ) does not impose any restriction on the corresponding u a . A phase space 
function with the property that it has weakly vanishing Poisson brackets with all constraints, is called 
a first class function (as an example, the Hamiltonian is always first class). So the <p a form the set 
of first class constraints. On the other hand, if we denote the remaining constraints by x m , then the 
sub-matrix A mn = {Xm,Xn} is invertible (if it was not, then there must be a linear combination of the 
Xm which is first class). As a result, the u m corresponding to these constraints are (weakly) fixed. Let's 
call the Xm second class constraints. The whole set of constraints is now split into first and second class 
constraints, where the first class constraints form a linear subset, and the second class constraints are 
fixed up to linear combinations of first class constraints only.0 

From the example we already learned that gauge transformations are generated in the same way as 
the symmetry transformations considered in ( 1.17| ), with the conserved charge replaced by the constraint. 



2 Actually this split can only be made separately at each point on the constraint surface. However, if we again assume 
that the Lagrangian is somehow regular, we can do the split globally. This requires, e.g., that the (weak) rank of the matrix 
A a p is constant, which will be the case in all our examples. There are, however, some interesting theories, especially gravity 
theories with singular metrics, which do not have this property. 
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They show up in the Hamiltonian as a term proportional to the constraint, with a free parameter in 
front. If F is some phase space function, then 



5F = u a {F,cj )a } *{F,u a <j> a } (1.50) 

gives the transformation of F generated by the special linear combination u a <j) a . For our general Hamil- 
tonian, the constraints appearing in this way are exactly the primary first class constraints. So we 
conclude that these are the generators of gauge transformations. However, if we look at the Hamilto- 
nian and the constraints in their final form, the distinction between primary and secondary constrains 
doesn't show up anywhere except in the fact that originally only the primary ones had free u parameters 
in front. But to extract the first class constraints we had to find the zero eigenvectors of the matrix 
A a p, and this might involve a mixing of primary and secondary constraints, so it is actually not clear 
whether one can define primary first class constraints at all. And it would be much nicer if we could 
forget about the distinction between primary and secondary constraints, as it only shows up in the 
derivation of the constraints and not in their final form (in contrast to the distinction between first 
and second class constraints). So let us prove that the secondary first class constraints generate gauge 
transformations as well. 

For this proof, let as for the moment assume that there are no second class constraints, then we have 
to prove that all constraints generate gauge transformations. Otherwise, the proof goes through in the 
same way, it just becomes technically more complicated.^] We know that every secondary constraint ip 
can be written linear combination of the form 

if> = c a ({^J a , H } + u p {Va, #}), (1.51) 

where the sum runs over only those q's that correspond to constraints discovered "earlier" than tp 
(including the primary ones). It was exactly that way the secondary constraints were derived, namely 
be eliminating the u's from ( |l.44| ). If we assume that all constraints appearing on the right hand side of 
(1.51) are generators of gauge transformations, then so is ip, for the following reason. If two constraints 



generate a gauge transformations, then their Poisson bracket does as well, because the transformation 
generated is just the commutator of both, and any combination of gauge transformation is gain a gauge 
transformations. Moreover, the commutator of a gauge transformation with the time evolution is also 
a gauge transformation. Whether we first make a gauge transformation and then evolve our system, 
or the other way around, that does not make any difference. Hence, the bracket of a constraint with 
the Hamiltonian is again a generator of a gauge transformation. It now follows by induction that all 
constraints generate gauge transformations, as we know that the primary ones do, and all others are 



successively given by (|1.51[) . 

Now comes a crucial step. If we are only interested in the physical state of our system, we are free 
to make gauge transformations at any time, so we are free to add any linear combinations of first class 
constraints to H. The parameters in front of these constraints can be any phase space function which 
can also depend on time. So, finally we end up with the Hamiltonian taking the form 

H(q,p,t) = H {q,p) +u a (q,p,t)(j) a (<l,P)- (1-52) 

Here, the terms containing the second class constraints have been included into Hq as they do not 
contain any free parameters. I should emphasize, that this step is entirely motivated by physics and 
takes us away from the purely mathematical question of finding time evolutions extremizing the action. 
This Hamiltonian no longer generates solutions to the original Euler Lagrange equation. But physically 



Alternatively, one can use the Dirac brackets to be introduced below and do the proof formally is given here. 
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a time evolution obtained by integrating the Hamilton equations starting from a state on the constraint 
surface will always be gauge-equivalent to some solutions of the Euler Lagrange equations. Finally, the 
complete set of equations of motions split into the first and second class constraints 



i(q,p) = 0, Xm{q,p) = 0, (1.53) 



and the Hamilton equations 



.,• dH . dH , 

* = P ' = W- (L54) 

Using the bracket notation, they read 

x = {x,H}n{x,H } + u a {x,<t> a }, (1.55) 

showing how the time evolution is split into a "physical" evolution and a gauge transformation. Notice, 
however, that here we are again using the somewhat sloppy notation, taking the bracket with the point 
x, which should actually be a phase space function. 

This in a way completes the classical part of the Dirac programme, but let us consider the second 
class constraints a bit further. We got a quite nice understanding of the physical meaning of the first 
class constraints, but yet we have no idea whether there is something similar for the second class ones. 
From our particle moving on a circle we know that these can emerge from exterior constraints imposed 
on the configuration variables in the Lagrangian. Maybe they are just a kind of "trivial" restrictions 
which simply reduce the actual "size" of the phase space but are not very interesting otherwise. We 
could equally well solve the particle on the circle by choosing only one (periodic) configuration variable 
ip and we would end up with the same solutions. But we won't end up with the same Hamiltonian. 
This is because in (1.35) there are contributions proportional to the second class constraints which will 



never show up in the simplified model. So, it seems to be non-trivial to get rid of the extra phase space 
variables after applying the Hamiltonian method. If we simply change coordinates such that two of 
them are the angle ip and some conjugate momentum, the second class constraints do not drop out 
form the Hamiltonian, because they will still be necessary to provide the correct equations of motion. 
One should expect that (p and ir = mujr are the relevant phase space coordinates that also appear in 
the simplified model as conjugate variables. But observe that, depending on how they are defined as 
functions of our phase space variables, their Poisson bracket will in general not (and not even weakly) 

be{<M} = l-0 

Nevertheless there is a way to get rid of the second class constraints in the Hamiltonian. The trick 
is to modify the Poisson bracket such that all constraints become first class. If we fix a set of second 
class constraints Xm, it is possible to define a modified bracket, the Dirac bracket, which still gives all 
the correct equations of motion, but for them we will have 

{F,Xm}*~0 (1-56) 

for every phase space function F. As a result, terms proportional to them appearing in the Hamiltonian 
will not contribute to the evolution equations (not even generating a gauge transformation) , and we can 
drop the corresponding terms (actually: we are free to add terms proportional to the Xm to any function 
without weakly changing their brackets, especially to the Hamiltonian). To see how this can work, let 
us consider the most simple set of second class constraints. Suppose we have a phase space spanned by 

4 Take 4> = arctan(— px/p^) and tv 2 = p\ 2 +P2 2 , then their bracket even vanishes. 
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q l and pi, i = 1, . . . , n, and there are two constraints xi = Q 1 an d X2 = Pi- These are obviously second 
class. They just tell us that the first degree of freedom is of no interest at all, we should have left it 
apart from the very beginning. What would be the Poisson bracket on the phase space if we started 
without the first degree of freedom? Clearly, it would be 

s F G \ -y ^L^L _^_^L n 5 7 ) 

* ^ 

We just have to drop i = 1 from the summation. We can express this in terms of the full Poisson bracket 
and the constraints only. It reads 

{F, G}* = {F, G} + {F, xi}{ X 2, G} - {F, X2 }{xi, G}. (1.58) 

You can easily check that this is exactly the same as above. In a sense, the two constraints here 
are "conjugate" phase space variables, and what the last equation does is to subtract that part of the 
Poisson bracket that comes from this pair of variables. You can also check that the new bracket vanishes 
whenever one of its arguments is one of the constrains, and as we shall see it has all the properties of 
the Poisson bracket given in ( p.. 14 ). It seems that what we have to do is to find a transformation of 



variables such that the second class constraints become pairwise conjugate. This needs their number to 
be even, will this always be the case? Remember that we had a matrix 

Arm = {Xrn,Xn}, (1-59) 

and the second class constraints were defined such that this is non-singular. But an antisymmetric 
matrix must have even rank, so the number of second class constraints will always be even. It is not 
clear how this problem can be solved in general, because it is not always possible to find these pairs of 
variables. However, we actually only need to solve a "linearized version" of the problem. That is the 
following. If F is some phase space function, then we want to have 

{F, Xm }* ~0, (1.60) 

for all second class constraints. We make an ansatz 

{F, (?}* = {F, G} - {F, Xm } A mn { X n, G}, (1.61) 



which is similar to what we had above. You can now simply solve the equation ( |1.60 ) to find that A mn 
has to be the inverse of A mn , i.e. 

A mn A np = A mn {xn, X P } = S m p . (1.62) 



Actually we only need this equation to be satisfied weakly in order to get ( [1.60 ). But if we can solve 



it "strongly", then {F,Xm}* would also vanish strongly, which might be quite useful as we can then 
use the second class constraints anywhere inside the brackets, however nested they are. In most cases 
this can be established, but not in general, as the matrix A mn might become singular at some point 
outside the constraints surface. However, its determinant is non-zero on the surface, so there is a (finite) 
neighbourhood were it can be inverted to give A mn . Then, (1.60) holds on a whole neighbourhood of 



S, and beside {F, Xm}* all its derivatives will vanish, and we can use Xm = as a strong equality inside 
brackets, as long as we finally evaluated the result on the constraint surface. So, with the Dirac bracket 
we don't have to worry about the second class constraints any more, they just become strong identities 
for all practical purposes. 



13 



But yet we haven't proved that they do the job, i.e. that they really provide the same equations of 
motions as the Poisson bracket. This is not very difficult, as 



{F, H}» = {F, H} - {F, Xm } A mn { X n, H} « {F, H} = F, (1.63) 

for any phase space function F, because the Poisson bracket of the Hamiltonian with any constraint 
vanishes weakly. There remain some other things to be shown, namely that the new bracket has the 



same properties ( 1.14 ) as the old one. Except for the Jacobi identity this is immediately obvious. With 
the formalism derived so far, the only possibility to prove the Jacobi identity is explicit calculation, 
which is rather involved. As always in cases like this, a very simple and almost trivial proof exist if one 
uses a different formalism, which makes more use of the geometric structure of the phase space. So, let 
us skip the rather tedious proof for the Jacobi identity here. 

After introducing the Dirac bracket, we are now also free to add anything proportional to second 
class constraints to the Hamiltonian, without affecting the equations of motion. Hence, we can take it 
to be 

H(q,p,t) = H (q,p) + u a (q,p, t) MQ,P) + v m (v,P,t) Xmfap), (1-64) 

but we can equally well set v m = 0. However, the freedom to choose v means that the "allowed" 
Hamiltonians are now given by all phase space functions weakly equal to Hq. Hence, we only need to 
know the energy, i.e. H on the constraint surface. Its extension can be chosen arbitrarily. The Dirac 
bracket is designed such that it "picks up" the derivatives of H tangent to the surface and in the "normal" 
directions corresponding the first class constraints only, thereby generating the undeterministic gauge 
transformtions, but ignoring the derivatives in the direction of the second class constraints. 

Regarding the "classical" Dirac programme, we have more or less reached the goal. Let us sumarize 
what we have found. The constraints single out a subspace of the phase space. A point in that subspace 
corresponds to a physical state and can be inserted into the Hamilton equations as initial condition. 
The first class constraints generate gauge transformations. Moreover, they provide a representation of 
the gauge algebra, which is the Lie algebra of the gauge group associated with the system. The Lie 
bracket is of course given by the Poisson bracket: 

{4>a,4>b} = fab C <t>c- (1-65) 

We cannot expect that the structure functions f a b c are constants, in general they will be some phase 
space function. We can only try to redefine the constraints such that the structure functions become 
constant. If that doesn't work, it will be of some importance for the quantum theory. But why is the 
Poisson bracket of two first class constraints again a linear combination of first class constraints? We 
know that it vanishes weakly, as this was the definition of first class phase space functions, so it is 
certainly a combination of constraints. So we have to prove that the bracket of two first class functions 
F and G again gives a first class function. In fact, the bracket of {F, G} with some constraint gives 

{{F, G},ip} = {{F, ip},G} — {{G, ip},F}. (1.66) 

Now F is first class, so {F, ^} can be written as is a linear combination of constraints, whose bracket 
with G vanishes weakly, and vice verse, so the whole right hand side vanishes weakly. As a result, only 



first class constraints appear on the right hand side of (|1.65| ). 

For an unconstrained system we introduced observables as phase space functions that correspond 
to measurable quantities. What is the analogy for constrained systems? First of all, a measurable 
quantity should be a function on the constraint surface only, as a state outside will never be realized. 
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We can nevertheless deal with functions on the whole phase space as representations of observables. We 
simply define two function representing the same observable if they are weakly equal. But if there are 
gauge degrees of freedom, we had to assume that certain quantities are not measurable, as they have an 
indeterministic time evolution. Only those quantities are measurable, which are invariant under gauge 
transformations. As gauge transformations are generated by the first class constraints, a function is 
invariant if its brackets with the first class constraints vanish weakly. Using the Dirac brackets, we can 
equally well say that the brackets with all constraints have to vanish weakly: 

{O,0 a }«O & {O,V>a}*~0. (1.67) 

We can obviously add and multiply observables, but what about taking brackets? We have to assure that 
the resulting bracket is independent of the representations chosen for the observables. So, let F\ ~ F2 
be two representations of the same observable, and G another observable, then F\ — F2 — u a ip a is 
some linear combination of constraints. From ( |1.67 ) we conclude that its Dirac bracket with G vanishes 



weakly, so that 

{F 1 ,G}^{F 2 ,G},. (1.68) 

The Dirac bracket therefore gives a unique operation on observables, which are equivalence classes of 
phase space functions. As for the unconstrained systems, the observables form a Lie algebra, but the 
Lie bracket is given by the Dirac bracket and not by the Poisson bracket. We can also define conserved 
charges as those observables which are constant in time. In addition to ( |1.67 ), a conserved charge Q 
has to satisfy 

{Q,#o}*~0. (1.69) 

It generates symmetry transformations 

SF = {F,Q}„ (1.70) 

which map solutions onto new solutions, as can be shown in the same way as for unconstrained systems. 
Actually only the weak version of ( |1.70 ) is relevant, as solutions always live on the constraint surface. 



It is however important that the Dirac bracket appears: take F to be a second class constraint, then we 
must clearly have 5F ~ 0, which is only guaranteed by the Dirac bracket. Moreover, we see that if F 
represents an observable and Q a conserved charge, the 5F is again an observable, which is independent 
of the chosen representation. So the symmetry transformation are well defined on the observables. 

If we just use the definition, the constraints themselves also become conserved charges: the brackets 
(1.67) and (|1.69| ) both vanish weakly. And they are indeed generators of symmetries, which are however 



"unphysical" , i.e. they do not change the physical state (whereas in general symmetry transformation 
do change the physical state). But remember that we defined observables to be equivalence classes of 
phase space functions with respect to ~. So all constraints and all combinations thereof just represent 
the trivial conserved charge Q ~ 0. Moreover, if we choose different, but weakly equal representations 
for some conserved charge, then they will generate different symmetry transformation, but the difference 
is just a gauge transformation. 

Quite generally, we can define a reduced phase space as the quotient of the constraint surface S 
with respect to gauge transformations. Then, an observable simple becomes a function on the reduced 
phase space. There is also a bracket on that space, as the Dirac bracket of two observables is again 
an observable. If we go over to the reduced phase space, we can forget about all the constraints and 
gauge transformations, it will just look like the phase space of an unconstrained system. But in general 
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it will not be a cotangent bundle of some configuration space, and possibly it will not even be a proper 
manifold, so it is much harder to deal with, especially when we try to quantize it. 

What are the observables and conserved charges in our examples? For the particle on the circle, we 
found that the constraint surface was two dimensional, so there are two independent observables and 
every observable is a function thereof. They can be taken to be <j> and it = mujr as they appear in (|1.34| ). 
There are many ways to express them in terms of phase space function, for (f> we can take arctan(— P1/P2) 
or arctan(g2/(/i) (or anything weakly equal). If we take ir 2 = p\ -\-p\ (some signs have to be fixed for the 
square root as well as for the arctan, but it should be clear how), then the Poisson bracket of the two 
observables is obviously not well defined, as the value depends on which representation we choose. But 
the Dirac bracket is, as you can check, always {(f), 7r}* ~ 1. The momentum ir is a conserved charge: 
the generator of rotations. 

For the two-dimensional gauge theory, the independent phase space functions p\, P2 and q\ + q2 
have vanishing brackets with ip. However, p\ and P2 are weakly equal, so they represent the same 
observable and we are left with two independent observables, P p\ « P2 and Q « q\ + q%- Their 
bracket is {Q,P} ~ 1, so they are position and momentum of a particle in one dimension. The 
reduced phase space is just that of a particle in one dimension, and the Hamiltonian can be written as 
H = \mT x P 2 + V{Q). 

2 Quantization 

How do we quantize an unconstrained system? We usually take the configuration variables and momenta 
as the basic observables and define a representation of them as quantum operators on some Hilbert space 
7i. In quantum theory, a state is represented by a vector in this state space. The operators have to 
be chosen such that the commutator of two operators is given by the operator corresponding to the 
Poisson bracket of the observables. What we need is a map ~, that assigns an operator to each phase 
space function, such that 

[F,G]=ih{F~G}. (2.1) 

And of course we want to have some other properties: the map should be linear, the constant observables 
should be represented by scalar multiplication, and we would also like to have something like FG = FG. 
This can in general not be established completely. Simply because if F and G have non-vanishing 
Poisson bracket, then the corresponding operators should not commute, and therefore we cannot have 
FG = FG = GF = GF. All we can do is to select some subset of observables, usually taken to be a 
set of canonical variables like the g's and p's, and for them we require (|2,1|). If we then give a general 
rule how to map phase space functions onto operators, we find that ( |2.1[ ) will only hold "modulo the 
operator ordering" on the right hand side, which means that 

[F, G] = \%{F\G} + 0(h 2 ). (2.2) 

There is always a standard representation if the phase space is the cotangent space of some configuration 
space Q. We can take the state space to be the set of square integrable complex wave functions ^ on 
Q and define 

F*(q)=F(q)*(q), ft = -ih || (q), (2.3) 

where F is a function of q alone. To get the operators for other functions, we have to expand them 
in a power series in p to obtain the representation in terms of a power series in d/dq. So far this is 
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standard quantum mechanics, but note then this is just a special representation, and depending on the 
actual phase space given there might be others which are more suitable. 

There is one additional restriction we have to impose on the operator representation, which ensures 
that our state space is not too big. The representation should be irreducible, which means that the 
state space cannot be decomposed into subspaces which are invariant under all observables. Then, 
each subspace alone would already form a complete state space. Using the Dirac notation \^f) for an 
element of the state space, and (*$>\ for its dual, we can briefly formulate standard quantum physics in 
the Schrodinger picture as follows. Given a state |^(to)), then it evolves in time as 

ift^|¥(f)> = £| *(*)>■ (2.4) 

Statements about measurements and physical properties of a state are made in form of expectation 
values of observables 

(F) = (V\F\W). (2.5) 

As we want expectation values corresponding to real phase space functions to be real, the scalar product 
has to be such that real observables are represented by Hermitian operators. Note that this is not a 
restriction on the operator representation ~ or on the scalar product on H alone, but it states a relation 
between them. This will be quite important for constrained systems. In the standard representation 
(|2.3|), the unique scalar product is 

<$|*) =Jdq $*(qMq), (2.6) 

where dq is the measure that is invariant under "translations" generated by the pi operators. Of course, 
this only exists if the configuration space in finite dimensional. If this is not the case, we have to use a 
different definition of the scalar product, as we shall see in our examples in the next section. 

We also have theorems like Ehrenfest's, which essentially tells us that expectation values behave 
almost like the classical phase space function. For example, we have 

( F ) = (* I {FTH} I *> = 4<* I [F, H]\*) + 0{h) = %:(F) + 0(H). (2.7) 

in ot 

We can define all this for a constrained system as well. We can even take the standard representation 
(|2.3| ) and what we get is a state space, let us call it V, and a representation of the phase space functions 
as linear operators. We also get a time evolution of the state, generated by the Hamiltonian. If there 
are gauge degrees of freedom, it will contain free parameters, and they will also appear in the quantum 
theory. The time evolution will not be unique. So far this seems to be quite straightforward. But let 
us have a look at Ehrenfest's theorem. If we want that the quantum expectation values behave like the 
classical phase space functions, this should especially hold for the constraints. So, we would like to have 

<* | ^ a | *) = 0. (2.8) 

We can take these quantum constraints as a restriction to be imposed on the state \^S>). In analogy to 
the classical counterparts we can call them physical states. The problem with this restriction is that it 
does not single out a linear subspace of V, as it is a quadratic equation in vf. Moreover, what about 
expectation values of the form 

(VP | $ a $p | \fr) ? (2.9) 
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If we want Ehrenfest's theorem to hold, these have to vanish as well, and the same must hold for 
every function of the constraints that classically vanish, at least to the leading order in fi. If all these 
expectation values vanish, then we have an eigenstate, and the condition to be imposed on a physical 
state becomes 

$a I *} = 0. (2.10) 

This provides a linear subspace, which is called the physical state space. At first sight, it is not clear that 
an equation like this can actually be solved, because it states an eigenvalue equation for |^). Perhaps 
the operator in front doesn't have zero as an eigenvalue. If this is the case, there is no other way out than 
to choose a different operator representation. However, things are not as restrictive as they might seem. 
The representation space V is not required to be a Hilbert space: there is no "finite norm" restriction on 
|\E'), nor is the constraint operator Hermitian or whatsoever. Using the standard representation, ( 2.10j ) 



becomes a set of differential equations, but without any boundary conditions, which is in general easy 
to solve. 

If we take our simple gauge system as an example, the standard representation is given by ^(qi, #2)1 
and pi = —ihd/dqi for i = 1,2. The quantum constraint is 

(2.11) 

oqi 012 

Clearly, the general solution is = ^(gi+q^)- This shows two crucial properties of constrained systems. 
The first is that such a function can never be square integrable on K 2 . Generally, for constrained systems 
the standard product will not give a suitable scalar product on the physical states. A second property 
of the wave function its gauge invariance. Remember that in the classical theory we had to perform 
two steps to get a gauge-invariant object: we had to solve the constraints, and then we had to consider 
equivalence classes of states generated by the first class constraints. Here, it seems that the first step 
already leads to a gauge invariant object. This also holds quite generally, but let us first consider the 
conditions fl2,10|) in more detail. 

There might be another problem when we try to solve them. It is not obvious that the equations 
are compatible. They imply 

$a,fe]|*> = 0, (2.12) 

and this should not impose any new restriction on the state. For the classical constraints we know 
that their Poisson bracket is again a linear combinations of constraints, if at least one of them is a first 
class constraint. But what happens if we take two second class constraints in ( 2.12p ? In the worst case 



the commutator will be the identity operator (as in the trivial example xi — Q \ X2 = Pi), and the 
condition gives \ip) = 0. But it should now be clear how to proceed. At the classical level, we solved the 
problems concerning second class constraints by introducing the Dirac bracket. We can do the same 
here. Instead of defining the operators to be a representation of the Poisson bracket algebra, we use 
the Dirac brackets everywhere. This should work equally well, as they have the same basic properties 
as the Poisson brackets. Except that now we cannot choose the standard representation any more. 

Hence, if we have second class constraints, they have to be implemented in the quantum theory 
from the very beginning. We can impose them as exact operator identities, as the Dirac brackets were 
made such that the second class constraints behave like identities. The physical state condition is then 
trivially satisfied for them, we only have to impose it for the first class constraints. For the particle on 
the circle this means that we can choose the state space to be the set of complex wave functions on the 
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circle. The operators are then given by ( 1.34 ), with oj replaced by a suitable differential operator with 
respect to ip. This is of course where we also would have ended up if we had started form the simpler 
Lagrangian depending on ip only, describing a free particle on a circle. So, as at the classical level, 
the second class constraints are not of much interest. In a sense, they result from choosing unsuitable 
variables. In the following we will therefore assume that there are no second class constraints present. 
Otherwise just replace the Poisson brackets by Dirac brackets everywhere. 



If all constraints are first class, then we can expect ( 2.12 ) to be a linear combination of the equations 



( 2.10 ), provided that the classical algebra of the first class constraints ( 1.65 ) is preserved: 



ihf, 



ab 



(2.13) 



The structure functions f ao c in the classical algebra are in general phase space functions, so in the 
quantum theory they are represented by operators. These do not necessarily commute with the con- 
straints. So it is essential that they appear to the left. Otherwise ( [2.12 ) gives an equation for physical 
states that has no classical counterpart, and it might even give a contradiction as we saw for the second 
class constraints. The "structure operators" j a b c need, however, not be the operators associated with 
the classical structure functions, it is only necessary that they exist and appear to the left. If it is 
not possible to give an operator representation such that (]2.13| ) holds, we cannot impose ( j2.10|) , or if 
we impose it we would get a too small physical state space. This is a non-trivial problem in Dirac's 
canonical quantization. If the constraints can be arranged in this way, then ( gl3|) gives the quantum 
representation of the gauge algebra. In typical gauge field theories, where the structure functions are 
numbers and the gauge transformations act linear on the fields, this can always be achieved. If the 
gauge group is however more involved, e.g. the diffeomorphism group in gravity or string theory, it is 
not clear that we can always find a representation satisfying (2.13). There may however be a way out 
as we shall see in a moment. 

Let us first consider the time evolution. Using the decomposition ( [1.52 ) for the Hamiltonian, we 
have 



m 



dt 



(2.14) 



Actually the u a should become operators as well, as we were free to insert any phase space function for 
them in the classical theory. However, for physical states the last term vanishes anyway, and we are 
left with the first one only. To get a consistent theory, this must not take us away from the physical 
states. As the state space is a linear space, we can simply check this by acting on the time derivative 
of I if) with a constraint and see whether the result vanishes. 



, flnl I 9) 



H, 



9a 



(2.15) 



Clearly, the last term vanishes as |\&) is a physical state. At the classical level, we know that H is first 
class, so its bracket with any constraint vanishes weakly. Here, we get an extra condition, namely that 
this property is preserved in the quantum theory: 



[$ a ,H Q ] = ihg a h 
with the coefficients g a b again appearing to the left. 



(2.16) 



There is something else we can learn from ( 2.14 ), and we saw this already for our simple example. 
In the classical theory, the last term was the one that generated gauge transformations on the state. 
Here we find that it simply vanishes when it acts on a physical state. Physical states are already gauge 
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invariant quantities. Imposing (|2.10| ) somehow combines the two steps we needed at the classical level. 
There, we had to impose the constraints and then identify those states that are related by a gauge 
transformation. In a sense, these two operations are conjugate to each other. To illustrate this, let 
us try to interchange them. At the classical level, we could equally well first take equivalence classes 
and then restrict to the constraint surface. This would just result in defining some classes outside the 
constraint surface as well, which will afterwards be thrown away anyway. 

In quantum theory, starting from the representation space V, we can consider two states as gauge 
equivalent if they can be deformed into each other by a transformation generated by the constraints. 
As the states live in a vector space, we can simply declare two states to be equivalent if their difference 
lies in the image of the constraint operators: 

^ - |$) = $ a \n a ) (2.17) 

for some vectors \£l a ). Then, we can define the physical state space as the quotient space of V with 
respect to this equivalence relation. Intuitively, it should be clear that this will result in the same 
physical state space as the other procedure. Building the equivalence classes is effectively the same as 
imposing the constraints on the dual vector space[] 

(* | $a = 0. (2.18) 

In a sense, we are interchanging the kernel with the image of the constraint operators. The problems 
occurring are the same in both cases. If we cannot find a representation where the first class constraints 
form a closed algebra, the equivalence classes will become too big and therefore the physical state space 
too small. The only difference is that now we have to require the "dual" of ( p. 13 ), i.e. the structure 
functions have to appear to the right as 

=ih$ c f ab c . (2.19) 

It seems that the "operator ordering problem" , i.e. the problem of finding a representation that preserves 
the classical constraint algebra, cannot be solved in this way either. We just have to reverse all the 
operator ordering to get from one picture to the other. However, in many cases where this problem 
actually occurs, there is a way to solve it by combining the two procedures. Assume that it is possible 
to split the complete set of (first class) constraints into two subsets, denoted by (ft a and <ft* a . In many 
cases0 these turn out to be complex conjugate sets of constraints, but as long as we do not have a 
Hilbert space this is of no significance here. If they can be chosen such that 

Ik, fa] = lh lab fa, [fc, ^1] = m 4>*c f*ab C , (2.20) 



then we can apply a mixture of the two methods. First, we impose ( 2.10j ), but only for one half of the 



constraints, which is consistent by the first equation. Then, we take equivalence classes with respect to 
the second set. These equivalence classes will in general not be "tangent" to the result of the first step. 
In other words, adding something of the form cft* a \£l a ) to a physical state may result in an unphysical 
one. But that does not cause any problems, it simply means that the equivalence classes are smaller. 
Here is a simple example. Consider two first class constraints (ft and (ft*, such that their classical Poisson 

5 A dual vector can be considered as a proje ctor onto some one- dimen sional subspace of the state space, and if we 



consider only those projections satisfying (2.18), then the states in (2.17) effectively become equal because there is no 
projection that can distinguish them. 

An important exa mple w here t his t rick works is string theory, where the constraints form a Virasoro algebra, whose 



(i 



central charge violates (2.13), but (2.20) holds after splitting them into "positive" and "negative" frequency operators. 
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bracket vanishes but [4>, 4>*] oc Ti 2 . Clearly, ( ]2.13|) does not hold, but ( 2.20 ). Solving 0|vl/) = gives the 



physical states. Now, declare two such states as equivalent, if their difference is 4>*\fl) for some physical 
state You can easily see that this never happens except for |0) = 0. The equivalence classes are 
trivial and we get a well defined physical state space. 



If we use this procedure, we do not have the strong condition ( 2.10 ) any more, but Ehrenfest's 
theorem still holds for the constraints. In ( |2.8D, t he constraints will either annihilate the state to the 
left or to the right. For the higher order terms (|2.9| ), we probably have to "normal order" the constraints 
to make them act on the correct side, but this produces corrections of order 0{K) only, and these cannot 
be avoided anyway in Ehrenfest's theorem. So, even if we cannot define a proper algebra of quantum 
constraints, there are ways to overcome this problem. For simplicity however, let us in the following 
assume that we succeeded in finding a representation such that ( p. 13 ) holds, and that there is a well 
defined physical phase space, denoted by Ti. 

It is not a Hilbert space yet, so we have to define a scalar product. We want that the real observables 
are represented as Hermitian operators. In the classical theory, observables were found to be those 
phase space functions whose brackets with the constraints vanish weakly. Let us introduce a similar 
"weak equality" in quantum physics. We call two operators weakly equal if there difference is a linear 
combination of constraints, with the coefficients appearing to the left J^] 

F^G ^ F-G = u a ^ a . (2.21) 

Remember that the second class constraints have been realized as exact operator equalities, so they 
don't show up here. As an example, the condition ( 2.1(j ) can now be expressed as 



[#o,0a]~O. (2.22) 

A quantum observable O is defined to be an operator with exactly this property: it has to commute 
weakly with all constraints 

[O,0a]~O. (2.23) 

It is not immediately obvious the every classical observable can be related to a quantum observable, 
again because of the ordering problems which are present every time we are replacing a bracket by 
a commutator. But let us assume that we can find a representation such that the operator for any 
classical observable becomes a quantum observable. The fact that the commutators of observables with 
constraints vanish weakly tells us that the observable maps physical states onto physical states, as 

a 6|*) = 6<£ a |*)-[6,0 a ]|*) =0. (2.24) 

Here we used that every weakly vanishing operator, acting on a physical state, gives zero. The observ- 
ables are exactly those operators that act on the physical state space Ti. When restricting the operators 
on Ti., the weak equality becomes the identity. As for the classical theory, observables are actually 
equivalence classes of operators modulo the weak equality. If we want all this to be in analogy with the 
unconstrained system, Ti should become the physical Hilbert space, so that we can compute expectation 
values of observables using (|2.5|). As only observables represent physical quantities, there is no need to 
have expectation values for any other operator, and it is therefore sufficient to have a scalar product 
only on the physical state space. We saw already that sometimes it is even impossible to extend the 
product to the original representation space V. 

7 If we use the "mixed scheme" introduced above, the ^-constraints must appear to the left and the others to the right, 
and similar modification are to be made in what follows. 
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This final step of the Dirac programme turns out to be a rather sophisticated. There is no general 
recipe how to do it. In most cases it is pretty obvious how the product has to look, but sometimes it 
is not at all clear. All we have to fix it is that we require the real observables to become Hermitian 
operators, or equivalently that the complex conjugate of a classical observable is represented by the 
Hermitian conjugate of the quantum observable. This will ensure that Ehrenfest's theorem also holds 
for the conjugation: 

(F)* = {F*) O F* = F*. (2.25) 

As for unconstrained systems, the last equation gives a relation between the representation map ~ and 
the scalar product which defines the conjugation map K But is also imposes some extra restrictions on 
~ alone. If we are unlucky, all we did so far was in vain, because these are not realized. Suppose, for 
example, that there are three real classical observables satisfying AB = C, and the operator represen- 
tation gives AB = C. If A does not commute with B, this can never be a relation between Hermitian 
operators. So, when applying Dirac's quantization scheme, it is a good idea always to keep in mind this 
problem, which might occur in the last step. We should take care in all intermediate steps and make 
sure that they are compatible with the *-relation. 

From this point of view, the "mixed" scheme for the physical state conditions discussed above 
becomes more "natural" as well. To see why we should use two complex conjugate set of constraints in 



( 2.20 ), consider the following. When we are talking about complex constraints there is a little subtlety. 
At the classical level, we always considered real equations of the from (f> a {q,p) = 0. Now assume that 
we can combine the constraints into a set of half as many complex equations. We can do this whenever 
their number is even, but let us assume that there is a "reasonable" way to do so. Then the classical 
conditions are still the same as before, only their form has changed: they are half as many but complex. 
We can write them equivalently as 4> a = or (j)* a = 0. But this is no longer true at the quantum level, if 
we impose the conditions ( 2.1C| ). Should we impose 4> a \^) = or 4>* a \^) = 0, or both? It won't have any 



impact on expectation values, up to orders in h. Indeed, imposing only the first condition is just what 
we did when we used the mixed framework. That is because, if we have a scalar product, then (j> a \^) = 
becomes equivalent to {^\4>a = 0. What the mixed scheme actually does is to impose only half of the 
constraints. This turns out to be enough, because the physical state condition <j) a | = is always a 
complex equation. It is the quantum counterpart of the complex classical equation </> a (q,p) = 0. There 
is no need to impose real and imaginary part separately. 

In most cases, it is either possible to impose all constraints as (j) a \^) = 0, or they arrange properly 
in two sets of complex conjugate constraints, so that one of the scheme can always be applied. However, 
sometimes this final step really causes serious problems, and this seems to be the case in some approaches 
to quantum gravity. The maybe famous one in the moment is the Ashtekar programme. It turned 
out that by choosing suitable variables, the constraints to be solved become rather simple, and many 
physical states could be found. But the variables are such that during the quantization procedure 
one gets completely lost of the complex conjugation relations. The conjugation operation is realized 
non-analytically, which leads to the problem that if F can be given as a well quantum operator, F* 
will in general not have a proper operator representation, as it cannot be expanded in a power series 
of differential operators. It is clear that this makes it hard to construct the correct scalar product, 
and it is this point where the Ashtekar programme got stuck. So the most non-trivial step in Dirac's 
quantization programme is the last one: to find the scalar product on the physical state space. 



22 



3 Examples 



I will consider three examples here, all having gauge degrees of freedom, but of very different types. 
The first is the free electro-magnetic field. Here, the gauge group is rather simple, but we have to deal 
with a field theory which has an infinite dimensional phase space. This example will show that Dirac's 
programme will give us a manifestly gauge covariant quantization, in contrast to most of the "modern" 
methods, where a gauge fixing is necessary before it can be quantized. 

The second example is the relativistic particle in a background electro-magnetic field. It is the 
simples example for a system with parameter time. This means that the time coordinate that appears 
in the integral that defines the action is not the physical time, but rather some parameter that can be 
chosen arbitrarily. The possible reparametrizations of that coordinate will show up as gauge symmetries 
in the Dirac programme. 

Finally, in string theory, considered as a two dimensional field theory, we also have a parameter time, 
but in addition we have infinitely many degrees of freedom. It turns out that the resulting constraints 
can no longer be quantized such that their classical algebra is preserved, and as a consequence we have 
to use the "mixed scheme" to define the physical state space. 



The electro-magnetic field 

This is the simplest example for a non-trivial gauge field theory. Its action is given by 

I[A] = ~\Jd 4 x F^F^, F^ = d^A v - d v A^. (3.1) 

The spacetime metric is taken to be g^ v = diag(— 1, 1, 1, 1). We have to write this as an integral of 
some Lagrangian over the time. To do this, we have to choose a time coordinate, which we shall take 
to be the coordinate of Minkowski space. The index \x will be split into the component and the 
remaining spatial components i = 1,2,3. Similarly, the potential A^ splits into a time component Aq 
and the space components A{. The spatial indices can always be written as lower indices, as the metric 
for them is just the unit matrix. The Lagrangian becomes 

L^^jiV^-^, F , = V,- B ,M, F^B.A,-^. (3.2) 

Here, is the "velocity" OqA^. The configuration space is given by the set of spatial field configurations 
Au(x), and a time evolution is given by a spacetime field configuration Au(x,t), where t = x° is the 
coordinate over which the Lagrangian has to be integrated to give the action. This shows a general 
problem of the Dirac programme when applied to field theories. One has to single out a special time 
coordinate, thereby breaking the manifest Lorentz covariance of expressions like (|3,l|). At the classical 
level, there exists a generalization of the Dirac programme, the De Bonder Weyl canonical formalism, 
which does not break this symmetry and treats all spacetime coordinates similarly. Instead of a spatial 
field configuration evolving in time, it in some sense considers a field "evolving" in spacetime. The 
problem with this nice formalism is that up to now nobody has been able to construct a quantum theory 
based thereon. So, here we have to stick to this "non-relativistic" formalism, where some coordinate 
t = x° is distinguished. 

The momenta are found to be 

SVi(x) oV (x) 
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This gives a primary constraint 



01 (x) = -E (x). (3.4) 

Note that this is not just a single constraint, but a infinite set of constraints, one for each space point. 
The point x now plays the role of the index a in ( |1.43| ). This holds for the momenta as well: there is a 
momentum E fM (x) conjugate the field An(x) at each space point. For the Poisson bracket we have 

{A ti (x),E u (y)} = g tiV S 3 (x,y). (3.5) 

The Hamiltonian is easily found to be 

H = Jd^x^EiEi + lFijFij + EidiAo + uEo, (3.6) 

with one free parameter u{x) for each x. The time evolution equations are 

At = {Ai, H} = Ei + diA 0l A = {A , H} = -u, 

Ei = {Et, H} = -djFij, E = {E , H} = -d^. (3.7) 

Here it should be clear how the arguments x are to be added, so I didn't write them explicitly. The last 
equation tells us that there is a secondary constraint 

<hfr) = -9iEi(x). (3.8) 

As you can read off from the definition of the momenta, and also from the evolution equations, E{ is of 
course nothing but the electric field. There is one equation for Ei which doesn't contain time derivatives, 
namely the GauB law, which has to show up as a constraint. There are no more constraints, as 

{^ 2 ,H} = -d i d 3 F ij = (3.9) 

by antisymmetry. Especially, there are no restrictions on u. It is also immediately obvious that the 
constraints are first class, as they only depend on the electric field and therefore their bracket vanishes. 
Hence, we expect them to generate gauge transformations. Consider a general "linear combination" of 
constraints 



<t>[u] = J & A xui{x)<j)i{x) +u 2 {x)(i)2{x), (3.10) 

where u is a pair of scalar functions on space. Then, we have 

5A = {A , <j>[u}} = m, SAi = {A, 4>[u]} = d iU2 , (3.11) 

and the electric field in invariant. Clearly, this is just what we know to be a gauge transformation in 
electrodynamics. However, there is one point which should be mentioned here. If we look at the theory 
from the four dimensional point of view, a gauge transformation should be given as SA^ = d^u. So, 



actually the two parameters in(3.11) should be related by u\ = ii2- From the point of view we are 
taking here, these two are independent as we are only considering gauge transformations at a special 
moment of time. You can check that this is in agreement with our definition of gauge transformations. 
Using the time evolution ([T?]), you can achieve any transformation generated by ( |3.11 ) by going a small 
amount backwards in time, and then forward again with a differently chosen u. 
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Here we have what we already discussed quite generally. In the Hamiltonian, only the primary first 
class constraints appear with free parameters. But the secondary ones also generate gauge transforma- 
tions and so we are free to add them to the Hamiltonian, with free parameters in front. This gives the 
total Hamiltonian 



H = Jd\\E i E i + \F ij F lj + u 1 E -u 2 d l E u (3.12) 



where the Ed A term has been absorbed by redefining u 2 i— > 112 + Aq. As we concluded quite generally, 
this extended Hamiltonian no longer generates solutions to the original equations of motion (except 
if we choose 112 to be Aq). But physically we cannot distinguish them because they will always be 
gauge-equivalent to some extremum of the action. 

What are the observables? We already found that the electric field is an observable, as {Ei, 4>[u}} = 0. 
Eq ~ is also an observable, but it is a rather trivial one. The remaining observables are the magnetic 
fields Bi = TjSjjkFjk- There is no observable that depends on Aq, because it can be completely "gauged 



away" by ( 3.11] ). The energy is given as the value of H on the constraint surface, which is 

J&^lEiEi + lBiBi. (3.13) 

To quantize the field, let us choose the standard representation. The state |\&) is thereby given as a wave 
functional VP [A] of the gauge potential. Clearly, Ay, acts as a multiplication operator and the electric 
field as 

^'-^"-jBf)- (3 - 14) 

Imposing the constraints means that we have to require 

♦ , i* > -° «* fSt*) =o ' &| * >=o * di ^) =a - (3 - i5) 

The first one is trivially solved: VP must not depend on Aq. The second states that the state has to 
be a gauge- invariant functional. This is a general feature of standard gauge theories. The constraints 
are linear functions of the momenta, and if we choose the standard representation, the action of the 
constraints on the wave functional becomes a simply gauge transformation. Note, however, that this 
holds only as long as the constraints are linear in the differential operators. So, effectively the constraints 
tell us that a physical state must be a gauge-invariant functional. 

We can now apply another rather general technique to get an overview over the physical state space. 
It works as follows. Assume that we found some special physical state by explicitly solving the constraint 
equations. Then, we can act on that state with an observable, which will again give a physical state. 
As the physical state space is required to be an irreducible representation of the observable algebra, we 
can generate every physical state in this way. The result will be some kind of Fock space. Let us try it. 
Here is a special gauge-invariant wave functional, the Chern Simons form 

tf L4] = exp ( - — J d 3 x e ijk A^A^j . (3.16) 

Let us denote the corresponding state by |0). It is, up to normalization, the unique solution to the 
functional differential equation 

(Ei - iBi) I 0) = 0. (3.17) 
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The operator here is an observable, and together with its conjugate Ei + \Bi we have a complete set of 
observables. They are complex conjugate, so they might be good candidates to generate a Fock space. 
Choosing a suitable operator ordering, the Hamiltonian can also be written in a nice form as 



H= I d 3 x \{E\ + \Bi)(Ei - iBi). (3.18) 



This is exactly what we need for the typical Fock representation, as now we have -ff|0) = 0. So is |0) 
the vacuum state? Everything looks pretty nice up to now, but yet we haven't done the last step. The 
scalar product on the state space is still missing. How shall we define it? We could try 

($|¥) = JdA $*[A\&[A], (3.19) 

but a functional integral like this is really hard to deal with. In particular, we would get all the problems 
present in the path integral approach. Moreover, this product would, if it exists, be defined on the whole 
state space, not just on the physical state space, and we saw that this is in general impossible. However, 
we have something like a Fock space, so we should try to define it recursively. We normalize it by 

(0 | 0) = 1. (3.20) 

Now consider the "one photon" state 

\n) = yd 3 x^(x)(^(x)+iSi(x))|0), (3.21) 

which is physical, because the operator is an observable (its wave functional is J 2\£l l Bi an d solves 
(13151) ). We want our scalar product to preserve the complex conjugation, which means that the Her- 



mitian conjugate relation 

(E t (x) + LB^x)) 1 = (Ei{x) - iBi(x)) (3.22) 
should hold. Using this we can compute the norm of the new state 

(n\Q) = Jd 3 xd 3 y nt(x)n j (y){0\(E i (x)-[B i (x))(E j (y)+iB j (y))\0). (3.23) 

Using the commutator 

[Eilx^Bjiy)] = -ihe ijk d k 5 3 (x - y), (3.24) 



and (p7[) , this gives 



(CI | n) = 2h J d 3 xe ijk £l*i{x)djQ k {x). (3.25) 

This is not positive definite. By choosing suitably, it can become any negative number. What 
does this mean? Well, it tells us that it is not possible to define a scalar product such that the state 
|0) is normalizable and the complex conjugation relations for the observables are correctly realized as 
Hermitian conjugation relations. These were the two assumptions we made. To get a correct quantum 
theory, we have to find a different "vacuum state", i.e. another solution to ( |3.15| ), to build a Fock space 
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on. One needs to play a bit around to find the correct one that reproduces the standard Fock space of 
quantum electro-dynamics. It is 

•oM = «*(-^/^ *(*)*(-*)), (3.26) 

where Bi{k) is the Fourier transform 

Bi(k) = J d 3 x e- ik ' x Bi{x) = ie ijk kjA k (k), (3.27) 

and similar definitions will be used for all other fields. The commutators of the transformed operators 
are 

= -ift(2vr) 3 Sij 5 3 (k + I), 

[Ei(k),B s (l)] = -h(2n) 3 e ijk k k 5 3 (k + /). (3.28) 

This give the following operator for the electric field when it acts on a functional of Ai(k): 

= iajpsr < 3 - 29 > 

The Hamiltonian is also straightforwardly transformed as 

d 3 fc 



-—E^-QEiW+Bii-QBiW 



2(2?r) 
f d 3 k 

= J E*i(k)Ei(k) + Bt(k)Bi(k). (3.30) 

To find the creation an annihilation operators, we act with Ei{k) on the vacuum state flpp , which 
gives 

(Ei(k) + e ijk ^B k (kj) | 0) = 0. (3.31) 



So this is expected to be the annihilation operator. We define 



ai{k) = Ei{k) + e ijk ^-B k (k), a\(k) = E^-k) + s ijkT ±B k (-k). (3.32) 



They also form a complete set of independent observables as we can solve these equations for E^ and B k 
(except for k = 0, but B k (0) = anyway). They are conjugate to each other and the only non- vanishing 
commutator gives 

[Si(fe),ot-(0] = 27i(27r) 3 5ijk2 ~ hkj 5 3 (k - I). (3.33) 

|fc| 

The crucial property of this commutator is that the matrix appearing on the right hand side is positive 
semidefinite. It is now possible to define the scalar product. We repeat the construction from above 
with 

t d 3 k 

|«> = Jj^^i(k)a\(k)\0), (3.34) 
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Using the commutator ( 3.33 ) the norm of this vector becomes 

(n\n) = 2hj n*(k)^j(k)(k 2 s l3 - k^). (3.35) 

In contrast to ( 3.25p , this will never be negative. It follows that for all physical states generated by the 



a\ operators, we have (vl/|\l/) > 0. What remains to be shown is that the only state with zero norm is 
\*ff) = 0. The integral above becomes zero if an only if Cli(k) = kiQ,(k). Inserting this into ( 3.34j ) we get 



|0) = Jd d tcn i (k)a\(k)\Q) = J dx Q(k) fe» Ei(k) | 0) = 0, (3.36) 

because kiEi(k) is nothing but the Fourier transform of the constraint <p2{k) = ikiEi(k). The same 
holds if we replace |0) by any physical state, and therefore our scalar product is perfectly well defined. 
We also see that the function Qj appearing in the creation operator ( |3.34 ) has some "gauge freedom". It 



creates the same physical state if we replace (As) i— > Qi(k) + ki\(k) . So we might gauge fix by requiring 
ki$li(k) = 0, which means that there should be no "longitudinal modes" in f2. However, this is only 
necessary if we want to classify the physical states, say, by defining a Fock basis of the physical state 
space. Such a gauge fixing is completely unnecessary for the definition of the state space. The creation 
operators take perfectly care that we never create an unphysical "longitudinal" state, simply because 
they are observables. The gauge freedom just shows up as some ambiguity in f2: choosing different 
functions changes the operator only weakly. 

Finally, we should also check whether the a\ operators really create the correct eigenstates of the 
Hamiltonian, with positive eigenvalues. We can express H in terms of the creation and annihilation 
operators. We choose the correct operator ordering to get 

H = J^^a\{k)ai{k) => H\0) = 0. (3.37) 

Now we can compute the commutator of a creation operator with the Hamiltonian 

[H,a\(k)] = h &ijk \~ klkj 4(fc) « h \k\ a\(k), (3.38) 

\K\ 

because kjaJj(k) = kjEj(k) ~ 0. The correct commutator relation holds only weakly, but this is enough 
to get the right eigenvalue equations for the physical states. Hence, we finally arrived at the correct 
quantization of the free electro-magnetic field. Everything has been more or less straightforward, we 
just had to apply the rules of the Dirac programme. I should emphasize that, in contrast to other 
quantization schemes, our final result is completely and manifestly gauge covariant. At no point we had 
to impose a gauge fixing. All our stated are represented by wave functionals, which can be obtained by 
acting repeatedly with the differential operators corresponding to a\ on the vacuum functional ( |3.26| ), 
and all these functionals are gauge invariant. 

The problem with the wave functionals is, beside the fact that they will become rather awkward 
objects after after a while, that we do not have any idea of how to represent the scalar product. It should 
be some functional integral which becomes finite exactly for the states in our Fock space. It is quite 
obvious that then we are confronted, as already mentioned, with all the problems of the path integral 
approach. We had to integrate over gauge degrees of freedom, which requires some gauge fixing, ghosts, 
etc. So, it is much more suitable to take the Fock basis and forget the wave functionals. All we need is 
the representation of the Hamiltonian in terms of the en and a\ operators, and their commutators. This 
already defines them uniquely as ( |3.32| ). Of course, this was the way the special wave functional ( 3.26j ) 



was found: by solving the differential equation (|3.31[) , just like one does it to find the ground state of 
the harmonic oscillator in the "x-representation" . 
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The parameter time 

We shall now consider another class of systems with gauge degrees of freedom which are quite different 
from the standard ones. Their common feature is that they possess an invariance under the diffeomor- 
phism group of the underlying spacetime, which physicist often express as general covariance. Typical 
examples are the relativistic point particle, string theory, and general relativity. For the first, the gauge 
group consists of the diffeomorphisms of the real line, represented by "reparametrization" of the world 
line of the particle. The same in one dimension more holds for the string, and in gravity the gauge 
group can be considered as the diffeomorphism group of spacetime. 

As the first step in the Dirac programme is to write the action as an integral of a Lagrangian over 
time, we are confronted with the problem that the natural action for these systems is not given in 
that form. For the relativistic particle or string it is given as an integral over the worldline or world 
surface instead, and in gravity time itself becomes a dynamical object. The way out in these cases 
is to consider some "time-like" parameter, which behaves almost like a time coordinate, and perform 
the Dirac programme using this parameter time. Hence, all we need is that the action is given in the 
form of an integral of some Lagrangian over a real parameter. For the relativistic particle this can be 
chosen to be any parameter on the world line, for the string the non-compact coordinate on the world 
surface, and for gravity we need a foliation of spacetime, or equivalently there must be a special global 
coordinate r£l such that spacetime becomes a direct product of that R with some space manifold. 
In all these cases the action integral can be split into a "spatial" integral and a "time" integral, which 
is similar to the split we had to make in field theory, except that out time is not the physical time. The 
Lagrangian then takes the usual form, it becomes a function of the fields and there velocities, which are 
now defined with respect to the parameter time. 

After deriving the Lagrangian, we can simply forget that the parameter time is not the physical 
time, and perform the Dirac programme straightforwardly. The reparametrization invariance will show 
up as a gauge degree of freedom, i.e. as a set of first class constraints. These will take care that the 
state as well as the observables become gauge-invariant objects, which are independent of the chosen 
time coordinate. Moreover, all these theories have another common feature which reflects the fact that 
the "canonical time" is not the physical time: the Hamiltonian will always vanish on the constraint 
surface. This has two consequences. It means that there is no energy "conjugate" to the parameter 
time, which would be unphysical too. It also means that the total Hamiltonian can be written as a 
linear combination of constraints, hence as a generator of a gauge transformation. This again means 
that time evolution becomes a pure gauge transformation, which is what we have to expect because 
any type of time reparametrizations belong to the gauge group. From the physical point of view, there 
will be no time evolution at all. How can this be? We know that the relativistic particle or string does 
evolve in time, and in gravity spacetime is also dynamic. But we have to keep apart the two things 
called time, the physical time and the parameter time, and if we do it carefully, we will see that there 
is no contradiction. 

The relativistic point particle 

All this is most simply explained if we consider the relativistic point particle as an example. The action 
of a particle of mass m moving in flat Minkowski space is given by the length of the path. To make it 
not too simple, let us couple the particle to an external electro-magnetic field, then the action becomes 




(3.39) 
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where fi, v = 0,1,2,3, and the metric again g^ u = diag(— 1, 1,1,1). The extrema of this action are of 
course the timelike straight lines, but on these lines the parameter r can be chosen quite arbitrary. We 
can replace it by any function r i— > /(r) and we still get the same physical solution. The gauge group is 
therefore expected to be the diffeomorphism group of R. Let us apply the method we have developed. 
The Lagrangian is 



For the momenta, we find 



L[q,v]=m y f=vi%-ev^A lt (q). (3.40) 



dL mv,, 

Pn = W j=- I — § — ~ e Aft, 3.41 



and this gives a primary constraint 

^i^ + e^K/ + eA») + m 2 ). (3.42) 

Now let us consider the Hamiltonian. If our system is gauge-invariant under arbitrary rescaling of the 
time coordinate, then the Lagrangian must be a homogeneous function of the velocities. This can be 
see as follows. If the action is invariant under rescaling r i— > t', then we must have 

dr L(q, v) = dr' L(q, v') = dr' L(q, ^ v), (3.43) 

because the velocities scale as vdr = v' dr'. So L must be homogeneous in v, which is obviously true 
for our example. The function f = p ■ v — L, which has to be extremized to give the Hamiltonian, is 
homogeneous as well. For such a function we have v ■ df jdv = /. This tells us that whenever it has an 
extremum, the value of that extremum is zero. So the Hamiltonian will always vanish on the constraint 
surface. As already mentioned, this is in agreement with the physical statement that there can be no 
conserved charge, i.e. no "energy", conjugate to an unphysical time. The total Hamiltonian is therefore 
just a "linear combination" of the constraint 

H = u^ = \u{{p^ + eA^ + eA^) + m 2 ), (3.44) 

with a free parameter u. There are no more constraints as tp clearly commutes with H, and for the 
same reason it is first class. To derive the evolution equations, we have to take into account that A^ 
depends on q: 

e = {q»,H}nu(p» + eA>*), p lt = { Pli ,H}^-u(p v + eA v )ed )l A v . (3.45) 

You can check that, if p is eliminated from these equations, the potential enters the resulting second 
order equation for q only via the field strength. 

Let us take A^ = for the moment. What are the gauge transformations generated by tp? The 
brackets are 



{q»M=p^ {p",^} = 0. (3.46) 

This is a displacement in the configuration variable q along the direction of p. Note that on the 
constraint surface (which is the direct product of Minkowski space with the two parts of the mass 
shell) p is always a nonzero timelike vector. We find that two states (q,p) and (q',p r ) are physically 
equivalent, if p = p' and q — q' oc p. Hence, if we make a gauge transformation, the particle will 
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appear somewhere else in spacetime, at a place where is "was" or "will be" at an earlier or later "time" , 
but it will have the same momentum. If we look at the class of all states that are equivalent to some 
given state, we find that it is a straight line in phase space: the value of p lies on the mass shell, is 
constant, and gives the direction of the line in the configuration space, which is Minkowski space. So, 
the world line in phase space that passes through a given state (q,p) is the equivalence class of that 
state. The same holds for the particle in a background field. The states that are equivalent to some 
initial state (q,p) are exactly the points on the world line of a particle moving in the background field 
and passing though the event q with momentum p. Remember that we defined physical states to be 
these gauge-equivalence classes, so the physical states in our case are the world lines in phase space. 

Let us consider this from a different point of view and look at the time evolution of a state on the 
constraint surface. We said that we are free to choose the parameter u in the Hamiltonian as we like. 
So let us take u = 0. Then there is no time evolution at all. Given an initial state (qo,Po), at r = 0, 
integrating the Hamilton equations gives q(r) = qo and p(r) = Pq. This is a quite funny solution, the 
particle doesn't move through spacetime, it just sits at one event. Seems that it doesn't make any sense 
to consider such solutions. But you can convince yourself that this is a solution that extremizes the 
action. We could avoid such silly things by imposing boundary conditions on the paths like q° — ► ±00 
for r — ► ±00. But that doesn't make sense in the Hamiltonian picture. Here, were we are only dealing 
with states that evolve "locally" in time and not with the whole time evolution. We can generate even 
more silly solutions: by taking u to be an oscillating function, the particle will move up and down 
along the world line, or we can make it running backwards in time (don't mix this up with replacing 
p 1 — ► —p which is not a gauge transformations and not even a symmetry if ^ 0; we'll come to this 
in a moment). 

But we have to remember that all these "evolutions" are just gauge transformations. There will 
never be any physical time evolution, as all the states we pass through will always correspond to the 
same physical state, the same equivalence class of states. Nevertheless, somehow the particle should 
move through spacetime and we should be able see it moving. In other words, we should be able to 
observe the particle moving through spacetime. We have to remember what we said about observables. 
An observable is a phase space function that is gauge- invariant, or has weakly vanishing brackets with 
the constraints. For our particle, we must have {O, tp} ~ 0. Without the exterior field, the p^ are 
observables, but they are not all independent, because we have p 2 ~ m 2 . So they make up three 
independent real observables Pi = pi, i = 1, 2, 3, and one sign C which gives p° = Cy/piPi + m 2 and 
tells us on which part of the mass shell we are. Of course, C is an observable even if the field is switched 
on, and by inspecting the equations of motion, we find that it is the sign of the charge of the particle. 
At this point it is really important to note that the transformation C 1— > — C has nothing to do with and 
cannot be compensated by choosing another gauge, i.e. changing the value of u. It really changes the 
physical state, as the particle is now oppositely charged and moves on a totally different line, even if it 
is given the same 3-momentum at the same spacetime point initially. And note also that this "double" 
phase space occurs already at the classical level. The action ( 3.39 ) has two types of solutions: electrons 
and positrons, which behave different if there is an external field. This is not a quantum-effect, as is 
sometimes stated. The discrete observable C is a result of the classical Hamiltonian formalism. 

Again without the field, the observables and C are also conserved charges as we should expect. 
But what was the difference between observables and conserved charges? In addition to the first class 
constraints, conserved charges must have vanishing brackets with the Hamiltonian. But this doesn't 
give anything new here: the Hamiltonian consists of nothing but the constraint. Here we found another 
general property of systems with parameter time. Every observable automatically becomes a conserved 
charge. This is not really surprising, because if the time parameter r is not the physical time and 
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can be reparametrized arbitrarily, we should expect that a quantity that has a physical meaning must 
not depend on r. And "conserved" in this context means with respect to r and not with respect to 
the physical time, which in our case may be taken to be the phase space function q° (which is not an 
observable!). 

Beside the momenta, which are also "conserved" with respect to the physical time (whatever that 
in this context might mean), are there other observables? Let us think in a more physical way: what 
else can we measure? For example, where in space is the particle at the physical time q° = i? Given a 
state (q,p), how can we find the answer, if not accidentally q° = t. In that case, the answer is of course 
that the particle is at the space point with coordinates qi. Here is the general answer: make a gauge 
transformation from (q,p) to (q',p') such that q'° = t. By definition, this does not change any physical 
quantity. Then q[ gives the space point we are looking for. For = 0, it is not too difficult to write 
down an observable for this measurement: 

Qi{t) ~qi + ^P-Pi- (3-47) 

This gives a different phase space function for each value of t, which for any state on the constraint 
surface gives the space point where the particle is, was, or will be when q° = t. It is well defined because 
p° 7^ on the surface. You can check that its bracket with the constraint vanishes, so it is a conserved 
charge. We can also switch to another observer and ask for the position he will find at some time t' in 



his reference frame. Clearly, an explicit expression for this can be given by Lorentz-transforming (3.47) 
suitably, or more geometrically by finding the intersection point of the world line with some spacelike 
hypersurface. In principle, we can express everything which is a property of the world line itself and 
not of its parametrization in form of an observable or conserved charge. Note again that t has nothing 
to do with the parameter r and "conserved" does not mean that Qi(t) does not depend on t. If we set 
Qi = Qi(0), we get the following complete set of independent observables: 

C = sign(p°), Pi= Pi , Q i = qi -^ Vi . (3.48) 

Po 

The more familiar ones are functions thereof, e.g. the angular momenta 



Jfiu = q^Pu - qvPn => Joi = CQiVPiPi + m 2 , Jij = QiPj - QjPi, (3.49) 

which are not independent. We see that to extract physical properties of a state, we have to refer to the 
physical states, i.e. the equivalence classes under gauge transformations, which are the world lines. If 
we do this, we can totally ignore any evolution generated by the Hamiltonian, as there is none. Instead, 
to see a physical evolution, we must refer to the physical time, which in this case can easily be identified 
with a special phase space function q°. We can then ask for correlations between this physical time and 
other phase space functions, and it is this what gives us observables. 

In principle, we can do the same if there is a background field, things just get technically more 
involved. What we still have is the following. For every equivalence class and every physical time t 
there is exactly one representative for which q° = t. This is because the world line is always timelike. 
Now consider an arbitrary phase space function F, not necessarily an observable. For a given physical 
state, the value of F will depend on the representative. But there is an observable Ft which is defined 
to be the value of F at that representative where q = t. Clearly, the physical interpretation of that 
observable is the value of F at the time t. By definition Ft is constant along the world line, so it will 
have a vanishing bracket with the constraint. If we take F = qi, which is not an observable, then Ft will 
be the space point where the particle is at q° = t, and this is the observable Qi(t) as defined above. Of 
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course, to give an explicit phase space functions in general one needs to solve the equations of motion. 
This is what we could do for the field- free case above, which gave us the rather simple expression ( 3.47| ) 
for Qi(t). 

Hence, we found that the main problem for systems with parameter time is the construction of 
observables. It is somehow clear from the physical point of view, as all we have to do is to identify the 
physical parameters describing the state, but it is rather complicated to give explicit representations 
of these functions in terms of phase space function. In more complicated theories like gravity this is 
indeed one of the major problems. But the ansatz is very similar: one has to identify some phase 
space function which describes the physical time, and then consider correlations with other phase space 
functions. What makes this a bit harder than for our simple system is that in gravity there is no 
"global" physical time. So one either has to deal with a local time, which can be described by some 
kind of "clock field" and refer to this, or one can use "cosmological clocks", which in principle is the 
translation of "boundary conditions" from mathematical into physical language. 

Let us now quantize the relativistic particle. We take wave functions ^(q) to represent states |vl/), 
and the operators are 

% = q^(q), p„ * (g) = (q). (3.50) 

The physical states are the solutions of the Klein Gordon equation (with = from now on) 



/ d d m 2 



V>l*> = & (— — -Ta-Wg) = 0. (3.51) 



We know the complete set of solutions, and we can most conveniently take the p eigenstates as a basis 
for the physical state space. For each three- vector k and an additional sign c = ±1 we have a state 
\k,c) which is represented by the wave functional 

^k,c(q) = exp (ikiqi — icujq ), with u = y kiki + (rn/H) 2 . (3.52) 

They provide the eigensystem of the observables Pi and C, which form a maximally commuting subset 
of (HI): 

Pi | k, c) = h ki | k, c) , C | k, c) = c | k, c) . (3.53) 

There is no time evolution at the quantum level, for the same reason as above: a physical state is a 
gauge invariant object, and time evolution is nothing but a gauge transformation. Here it is even more 
apparent that we really need the observables to extract physical information. At the classical level we 
could answer the question "where is the particle at physical time q° = i?" by looking at the equivalence 
classes, the world lines of the particle. We gave an answer by choosing a special representative (q,p) 
with q° = t. We cannot do this here any more. Quantum physical states are gauge invariant objects 
by themselves, we cannot choose a particular representation. Moreover, we cannot reinterpret the 
wave function on spacetime as a spatial wave function ^(qi) which evolves in time q°, as is sometimes 
suggested. This causes lots of problems, as for example there is no "conservation of probability" , so one 
cannot define suitable expectations values etc. The only way out usually suggested is that one has to 
consider pair creation an annihilation, and go over to multi-particle physics. 

But this is not true. It is sufficient to note that the Klein Gordon equation is not a "relativistic 
Schrodinger equation" . It is not a time evolution equation for the state but a constraint. We shall see 
that if we keep this in mind and find a suitable scalar product, we will not have any problems. We can 
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quantize one single relativistic point particle. The only way to extract physical information is via the 
expectation values of observables.. 



(Qi(t)) = (*\Qi(t)\y) (3.54) 

will gives us the expected position of the particle at physical time t, as seen by the observer whose 
restframe is the one we are working in. A similar expression can of course be given for any observer by 
replacing the operator with a Lorentz transformed one, just like in the classical case.0 

But what is the scalar product? Clearly we want that the observables ( |3.48| ) become Hermitian, 
and this will fix the product up to normalization. The momenta Pi and the charge C become Hermitian 
if their eigenstates are orthogonal. Hence, we must have 

(k, c 1 1, J) = (2vr) 3 6 CiC , 5 3 (k - I) f(k), (3.55) 

for some positive function f(k). To fix this function, we have to require Qi to be Hermitian. It was given 
by Qi = q% — (po) qoPi- This causes a problem, as it is not clear in which order the non-commuting 
operators^] (pb) an d §b are to be put. Let us choose the most general possibility, which is 

Qi = % - (1 - a) q (po) -1 K - a (po)" 1 q Pi 

= Qi - Qo (Po) _1 K + (Po)~ 2 Pi- (3.56) 

Here we used the commutator [(po)^ 1 ,^] = —^{po)~ 2 , showing explicitly that the ordering ambiguity 
corresponds to an 0(H) term. As this is an observable (for any value of a), its action on a physical state 
should be a physical state again. In fact, one finds that 

- d 

Qi \k,c) = — i— — I k, c) + ia kiuo' 2 \k,c). (3.57) 
oh 

This gives the following matrix element for Qi\ 

(I, c\Qi\k,c) = -i(2vr) 3 ( di(5(k - I) f(k)) + S(k -I) a hco" 2 /(Jfe)) . (3.58) 
It becomes Hermitian if 

d i f{k) = 2ak i u- 2 f(k) => f{k)=uj 2a . (3.59) 

This shows how requiring the real observables to be Hermitian gives a relations between the represen- 
tation map ~~ and the scalar product, as the parameter a is still free, and for any value we choose we 
will find Qi to be Hermitian. The same, of course, also holds for the operators Qi(t) for the place of the 
particle at q° = t, as they are given as a sum of Qi and another Hermitian operator. But nevertheless 
there is a preferred choice for the product that fixes a. Consider the angular momentum operators 



introduced in ( 3.49 ). As Qi does not commute with Pj, there is an ordering ambiguity in the definition 
of Joi- If we want it to be Hermitian, there is only one possible ordering, which is the symmetrized 
product. We can also turn the argument around and consider the classical equation Qi = (po) _1 ^iO- 
This can only become a relation between Hermitian operators if we take the symmetrized product in 



8 Here we are ignoring the problem that in relativity we cannot really make such kind of measurements, as they would 
involve a whole spacelike hypersurface as "measuring device" . But you can think of equivalent "local" measurements where 
the observables are given as projectors onto some eigenstates of Qi(t). 

9 Note that (po) _1 is a well defined operator on the physical state space 



34 



the corresponding quantum relation 2Qi = (po) _1 ^iO + JioiPo) 1 - Now we impose an additional re- 
quirement. We want that the angular momentum is represented in the standard form J^o = %Po — QoPi, 
where no ordering ambiguities occur. You can easily see that this fixes a = 1/2. 

Note, however, that this is just a convenient choice, it does not have any impact on the physical 
properties of our system, it just provides the nicest possible representation. Otherwise we would have 
to work with slightly unconventional angular momentum operators. We can also see that a = 1/2 is a 
"canonical" choice, because then the product becomes 

(k, c 1 1, c') = (2vr) 3 <5 C;C / 5 3 (k - I) u(k), (3.60) 

and this is the unique Lorentz invariant delta function on the mass shell. Clearly, it would have been 
much easier to require the angular momentum operators to be Hermitian straight away, which more or 
less immediately tells us that the delta function in the product must be Lorentz invariant. But for more 
complicated theories it might not be so obvious that there is such a distinguished representation. And 
it is important to note that there are some ambiguities in the scalar product and the representation 
which cannot be fixed by physical arguments. 



String theory 

The same technique can be applied to string theory. It is very much like the point particle, except that 
there is one dimension more. The action of a string moving in a flat Minkowski space is given by the 
area of its world surface. The latter is given as a function q^(a,r) of two parameters. We consider it 
as a two dimensional field theory. The "time coordinate" will be r, and a is considered as a spatial 
coordinate, which runs from to 2ir. We choose the closed string here, i.e. (? M (0, r) = q tJ, (2-K,r). The 
action can then be written as 

I[q\ = j drdcr d^d^ d a q v d T q v - d^d^ d T q v d T q u . (3.61) 

The expression under the square root is the negative determinant of the metric on the world surface 
that is induced by the embedding of the world surface into the background space, so this will give the 
total area covered by the surface. By splitting off the "time" integral, we get the Lagrangian 

L[q,v}= J da yjq^vi* q' v v» - v^v» q' u q'\ (3.62) 

where the prime denotes the derivative with respect to a. Similar to the relativistic particle, this is 
homogeneous of degree one in the velocities, showing that the time r is an unphysical parameter only. 
For the momenta we find 

SL q^ q' v v v - Vj^ qlq' v . . 



and the Poisson brackets are 



{q^),Pu{p)} = g»vS(o- - p). (3.64) 
And we get the following set of primary constraints: 

<M<7) = b^)p^(o-) + =p„(<7)g'». (3.65) 
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We will shortly see that these are first class and that there are no secondary constraints, but before 
showing this let us briefly look at the gauge transformations they generate, as this is typical for every 
general covariant theory. The same type of constraints are found in gravity as well. We can define the 
"linear combinations" 



Let us look at the second constraint first. The transformations on the right hand side are exactly 
those corresponding to an infinitesimal shift a i— ► a + v , where v is some "vector field" on the string. 
Thereby, the coordinate behaves like a scalar, and the momentum like a density of weight one. The 
gauge symmetries generated by the second constraint are the diffeomorphisms of the spatial coordinate 
of the string. A similar diffeomorphism constraint is found in all general covariant theories. In gravity 
we have three of them at each space point, generating the diffeomorphism group of the spatial manifold. 
We didn't have this for the point particle because there the "spatial manifold" was just a point. It 
is quite easy to give the full set of phase space functions that weakly commute with that constraint. 
They are exactly those which are independent of the special parametrization a. This is very similar to 
the electro-magnetic field, and these kinematical constraints are in general easy to solve. We just have 
to identify the invariants of a rather simple gauge group. Using the standard representation, this also 
holds for the quantum theory. The wave functional can be any invariant that does not depend on the 
momenta. 

The other constraint, however, behaves quite differently. It looks similar and the transformations 
generated are also similar to those of the point particle. It generates the diffeomorphisms of the r coor- 
dinate, but because this is our canonical time coordinate, these diffeomorphisms are dynamical. They 
are not realized by simply replacing r i— > r + u, which doesn't make sense in the Hamiltonian picture. 
Instead, the contraint generates those transformation on the phase space that formally corresponds to 
the time evolution. It replaces the (non-existing) time evolution generated by the Hamiltonian, and is 
therefore also called Hamiltonian constraint or dynamical constraint. We had the same for the point 
particle: the constraint generated a gauge transformation which effectively looks like a time evolution. 
Here the situation is just a bit more complicated as we do not only have one dynamical constraint, 
but one for each a. The typical structure of the dynamical constraint is the quadratic term in the mo- 
menta, which we also had for the point particle and which also appears in gravity, where the (quantized) 
dynamical constraint is the famous Wheeler Be Witt equation. Because of this quadratic term, it no 
longer generates gauge transformation on the wave functional, like the kinematical constraints did. So 
it is much harder to solve. It contains the whole infomation about the dynamics of the system, as there 
is no Hamiltonian. 

Hence, we find that generally in systems with parameter time the dynamics is encoded in the con- 
straints and not in the energy function, which always vanishes. These dynamical constraints are typically 
quadratic in the momenta, like the energy for unconstrained systems. For field theories like strings or 
gravity this causes another problem. In the standard representation the square of the momentum oper- 
ator will not be well defined, as it becomes a functional differential operator. This problem was present 
in electro-dynamics as well but there it shows up only in the Hamiltonian and not in the constraints, 
so we could simply ignore it until we finally "automatically" found an expression for the Hamiltonian 




(3.66) 



The gauge transformations generated by them arc 



{q»,<j)i[u]} = up^, 



fc^iM} = d a {ud a q^), 



(3.67) 
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that was well defined on the physical state space.f^] This of course leads to the well known problems of 
quantum field theory, we need a regularization etc. All these typical problems will now already show 
up in the constraint algebra. 

So before quantizing the string we should think about a regularization, as it is quite obvious that 
there is no well-defined quantum operator for the constraint (f)\, which contains the square of the 
momentum. The most suitable regularization is to Fourier transform the cr-coordinate, which is compact 
and therefore we can replace integrals by sums. Then we can take the inifinit sums quite naturally as 
the "regulator — ► 0" limits. But before we do this, let us first write the constraints in a more convenient 
way. And, of course, we still have to prove that there are no secondary constraints. As the Hamiltonian 
vanishes weakly, all we have to do is to check whether the brackets of the primary constraints with 
themselves already vanish weakly. We can write the constraints more symmetrically by introducing 

a M = \{Vp, + fyu = \ (Pn ~ 3m) ( 3 - 68 ) 
as auxilliary variables. Their brackets are 

{a ll (a),a u (p)} = \g^ v 5'{a - p), 

{b fi (a)Mp)} = -b^S'(a-p), {a^a),b u (p)} = 0. (3.69) 

You can think of these two sets of variables as describing "right" and "left" moving modes on the string 
world surface. They form an almost complete set of variables. They are not completely independent, 
as 

P^ = J da Pfl (a) = 2 [da a p (a) = 2 J da b^a) (3.70) 

is the total momentum, and the only quantity that is missing in these variables is the center of mass of 
the string 

Q» = ^Jda q^a). (3.71) 

But this does not show up in the constraints, so that they can be combined into two simple functions 
of the a's and b's: 



Wi + h) = aaa", <f>- = h(<l>i-<h) = buV i . (3.72) 



For them, we find 



{<M<t), <Mp)} = S'(a - P ) {</>+(*) + Mp)), 

{^(a),^(p)} = -5'(a - p) (Ma) + 0_(p)), (3.73) 

and the mixed bracket vanishes. So, we don't get anything new. We have two separate algebras of first 
class constraints. A more familiar form of the algebra is obtained by Fourier transforming everything 
which is a function of a. For all such functions we define 

/(A;) = J da e-' lk ° f(a) & f(a) = -L £ e i/cCT f(k), (3.74) 



10r 



The functional derivative operator inside the integral (3.37) is not well defined when acting on a general non-physical 



functional of the gauge potential, try it with the Chern Simons form! Note that the problem is not that the integral doesn't 
converge, the operator itself is ill-defined. 
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where k is an integer. The k = modes of the canonical variables then become the center of mass and 
the total momentum 

^ = ^T' ^=^(0) = 2a M (0)=2 6 M (0) (3.75) 

The basic Poisson brackets are 

{q^(k),p„(l)} = 2tt g fMU 5 k+ i => {Qp,Pv} = 9iu,. (3.76) 

for the canonical variables, and 

{a^(k),a u (l)} = mkg^ u 5 k+ u {b^(k),b u (l)} = -in k g^ u 5 k+u (3.77) 

where 5 k = 1 for k = and otherwise. For the transformed constraints we now get the well known 
Virasoro algebra 

{<P + (k),Ml)} = Hk-l)Mk + l), 

4>-{l)} = -i(k- I) ^-{k + I). (3.78) 

To quantize the string, we choose the standard representation again. We represent a quantum state by 
a wave functional $ [q] , and the basic operators are 

q^k) *[q] = qft (k)*[q], p^k) *[q] = -2mH — — [q]. (3.79) 

The operators for the a's and b's are straightforwardly obtained by using the transformed of ( |3.68j ): 

a^k) = \{p^k)+ikq^k)), %(k) = ±{p fl (k)-ikq fl (k)). (3.80) 
Finally, the Fourier transform of the constraints gives the following quantum representation: 

^+( fc ) = ^E M^(k-l), $-(k) = ± y £b fl (l)P(k-l). (3.81) 

If we look at the products of operators appearing therein, we find that for fc / we always multiply 
commuting operators, so there is no ambiguity when quantizing those. However, if k = 0, each term in 
the sum consists of non-commuting operators, and it is not clear in which order they have to appear. 
Let us rewrite the relevant constraint as 

^+(o) = ^^ + ^-E a,(0^H) + ^E 2 M H)8"(0. ( 3 - 82 ) 

l>0 Z>0 

and similarly for (fi_(0). I just wrote the sum over the positive and negative Vs separately, and replaced 
a u(0) by the total momentum. In this form, there will never be a solution to that constraint, simply 
because the sums will never converge both when acting on some state. If the first converges when 
applied to a state we must have 

0,^(1) 2 M H) | ¥) -► for Z oo. (3.83) 

Using the commutator of the a's this implies 

0,^-1)0,^(1)^) -ttTiDI\-V) ^0 for I -»• oo, (3.84) 



38 



where D is the dimension of the target spacetime. The second term will never converge, and so the 
second sum above cannot converge either. We have to reorder the operators such that both sums can 
converge. The condition for this is that almost all terms appear in the same order in both sums. But 
which order shall we choose? To see that there is a canonical choice, consider the following commutators 
of complex conjugate operators 

[a fJ ,(-k),a l/ (k)]=Tvhkg fiV , [b^{k),b v {-k)} =-irhkg^ u . (3.85) 

They behave like annihilation and creation operators. For them, this was something we also found 
for the electro-magnetic field, it is important that the commutator of an annihilation operator with 
a creation operator is positive. Otherwise we cannot define a proper scalar product. The Lorentzian 
signature of the metric is a bit problematic here, but let us just ignore this for a moment and assume 
we are in a Euclidian space. Then we must conclude that, if we want to build up a Fock space using 
these operators, we must take a^{—k) and b^k), for k > 0, to be the annihilation operators. But then 
the ordering for the constraints is fixed. Almost all terms must be such that the annihilation operator 
acts first, as otherwise there is no chance for the vacuum to be a physical state. We are however free to 
reorder finitely many term in the sums. Whenever we do that, we pick up some constant from ( 3,85| ), 



so we can summarize all these extra terms into a single additive normal order constant. The final result 
is 

+ (O) = -^P> + ±£ a^a^-^ + HN, 
871 71 fro 

0_(O) = ^-P^P^ + -V %(-l)P(l) + hN. (3.86) 

In principle, N can be different for the two constraints, but we will shortly see that we are forced 
to take the same for both. Now we can hope to solve the constraints. But still we are faced with 
another problem. We don't know yet whether the classical constraint algebra ( |3.78| ) is preserved by the 
quantized constraints. If we compute the commutator algebra of the constraints, this is what we get: 

[£+(*;), Ml)} = —h(k — I) Mk + I) - h 2 5 k+l (± D (k 3 - k) - 2 k N), 

[$-{k),$-(l)] = h(k-l)$-(k + l) + h 2 5 k+l (± D(k 3 - k) -2k N). (3.87) 

Hence, the algebra doesn't close any more, and the freedom to choose N is not sufficient to render it 
closed. This means that we cannot impose the conditions (j) a \^) = to get the physical state space. 
This time, we are forced to use the "mixed scheme", and it is indeed possible, as we can arrange the 
constraints into two complex conjugate sets, each of which forms a closed algebra. Again we are not 
free to choose these sets, if we want the correct behavior of the annihilation and creation operators. 
What we want is that a state with the property 

a^k) | 0) = for k < 0, b^k) | 0) = for k > (3.88) 



becomes physical, as this would be the vacuum. By looking at the sums in ( 3.8l| ), we find that such a 



state satisfies (j) + (k)\0) = for k < and <f)-(k)\0) = for k > 0, because in all terms in the sums there 
will be at least one with a negative argument, or one b^ with a positive argument, respectively, so 
each term will vanish separately when acting on |0). Hence, we define the physical state condition to 
be 

+ (jfc)|*) = O for £;<0, |*) = for k > 0. (3.89) 
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The remaining constraints are the complex conjugate ones, except for k = 0, where the constraints are 
real. You can see from ( |3.87p that the two subsets each form a closed algebra, even if we include the 
k = ones into both subsets. 

It is not too difficult to find a solution to these equations in form of a wave functional. What we have 



to do is to solve ( 3.88 ), which implies that ( 3.89 ) holds for k ^ 0, and then solve the k = constraints. 



Using the definition of the a and b operators, ( |3.88[) can be rewritten as 

p^k) | 0) = i \k\ q{k) | 0) for k ^ 0. (3.90) 

The general solution is 

9 [q] = exp(-^r£N q^k) q»(-kj) > (3-91) 

k 

with some wave function &(Q) that depends on the center of mass variable = q fl (0)/2ir. When 
acting with the (ft±(0) on this functional, all terms in the sums vanish, and we are left with the Klein 
Gordon equation 

(P^P 11 + 87thN)<£(Q) = 0. (3.92) 

Here we need that the two normal order constants are equal, as otherwise we would get a contradiction. 

The state given can be interpreted as the ground state of the string's internal degrees of freedom. 
In this ground state the string behaves like a point particle decribed by the wave function &(Q), which 
is completely analogous to the relativistic point particle in the last section. However, its mass depends 
on the choice we have made for the normal order constant, so effectively it depends on the operator 
ordering in the constraints. The explicit appearance of Planck's constant in m? = 8-kHN means that 
this is a real quantum effect. With the scalar product we have for the point particle, we can make an 
ansatz for the scalar product on the string state space. The norm of the ground states above are simply 
defined to be those of the corresponding state of the point particle. If we than act on the state with the 
creation operators a^k) and b^—k), with k > 0, to build up a Fock space, the scalar product can be 
computed by exploiting the commutation relations. 

Will this product be positive definite? Remember that we had a problem with the signature of 
g^y in the commutator relations ( |3.85| ), For example, we find that the norm of v^a^k)^) is irhkv^v^ 
times the norm of \*&), which becomes negative if the vector v^ 1 is timelike. However, here is one crucial 
difference to the electo-magnetic field. If |^) was a physical state, the state we just created is not 
physical, as a^(k) is not an observable. To create physical states, we have to act on the vacuum states 
with observables, and to find them is of course a non-trivial task. To quantize them is even more 
difficult, and it is this which leads to the famous consistency conditions of string theory, so for example 
to the critical dimension, which is D = 26 for this model. It also fixes N to be some negative number, 
which means that the ground state of the string is a point particle with negative mass squared. But 
note that this only affects the observable algebra. The physical state space is perfectly well defined 
outside the critical dimension, and everything is again manifestly gauge invariant, as we never had to 
impose gauge conditions. Especially, we did not need any ghosts or whatsoever to quantize the string. 
However, as already mentioned, the real problems show up when one tries to reproduce the classical 
observable algebra. 
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